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Determine  simple  methods  for  rating  noise  in  ship¬ 
board  spaces  in  relation  to  its  interference  with  speech 
communication. 


RESULTS 

1.  Representative  samples  of  ship,  office,  and  shop 
noises  were  recorded,  measured,  and  analyzed. 

2.  Naval  ship  spaces  tend  to  be  noisier  than  civilian 
spaces  where  equivalent  communicating  jobs  are  performed. 

3.  Sixteen  noises  were  selected  out  and  adjusted  in 
intensity  to  be  equally  speech-interfering.  Simple  phys¬ 
ical  measurement  and  calculations  on  these  16  equally 
speech-interfering  noises  showed  that  Speech  Interference 
(SI)  could  be  measured: 

a.  best  by  a  '"aging  the  Sound  Pressure  Levels 
(SPL)  in  mid-frequenc,  octaves  (300  to  600,  600  to  1200, 
1200  to  2400  cycles  per  second  (c/s)),  called  the  Speech 
Interference  Level  (SIL)  method; 

b.  next  best  by  using  weighting  networks  A  or  Din 
3  in  Sound  Level  Meters,  or  by  finding  the  SIL  (averaging 
the  SPL‘s)  in  the  octaves  from  300  or  600  to  4800  c/s. 

c.  least  well  by  fitting  spectral  noise  peaks  to 
Noise  Criterion  rating  curve  contours  of  which  the  Noise 
Criterion  Alternate  (NCA)  was  better  than  the  conventional 
Noise  Criterion  (NC)  or  International  Standards  Organiza¬ 
tion  (ISO)  contours. 

4.  More  complex  physical  measurement  or  calculation 
on  the  16  equally  speech-interfering  noises  showed  that 
Speech  Interference  could  be  measured  well  by  Articula¬ 
tion  Index  (AI)  methods.  Simpler  5-  and  6-octave  methods 
employing  a  generalized  speech  spectrum  were  almost  as 
good  as  the  more  elaborate  20-band  method  using  the  actual 
speech  spectrum  utilized  in  this  experiment. 

5.  Speech  Interference  could  be  predicted  well  by  using 
families  of  NC,  NCA,  or  ISO  curves  if : 
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a.  only  that  part  of  any  of  the  curves  that  centered 
at  500,  1000,  and  2000  c/s  were  used; 

b.  the  curves  "averaged  through"  spectral  peaks 
and  valleys  of  the  noise  spectra. 

6.  For  speech  in  quiet,  half  the  intelligibijity  lies  in 
frequencies  above  and  below  some  value  at  or  between  1600 
and  1900  c/s.  As  the  ratio  of  speech  to  noise  inter  sity 
deteriorates,  the  frequency  that  divides  the  speech  spectrum 
into  two  equal  halves  (as  regards  contribution  to  speech 
intelligibility)  drops  from  1600  or  1900  c/s  to  abott  800  or 
1000  c/s. 

7.  A  new  Speech  Interference  (SI)  noise  rating  contour 
was  developed  that  could  be  used  in  any  of  the  conventional 
ways  of  measuring  fjpeech  Interference,  namely: 

a.  to  estimate  the  SIL; 

b.  as  a  weighting  network  for  a  Sound  Level  Meter; 

c.  as  a  noise -rating  contour. 

8.  The  new  SI  contours  rated  the  Speech  Interference 
effects  of  the  18  noises  as  good  or  better  than  any  previous 
method,  and  in  addition  resolved  many  of  the  extreme  dif¬ 
ferences  among  the  three  speech  interference  rating  methods. 

9.  The  new  Speech  Interference  Contours  are  not  dras¬ 
tically  different  from  theoretical  extensions  of  the  AI  cal¬ 
culation  method. 

10.  Thermal  noises  (TN)  witl  spectra  shapes  of  -12, 

-6,  and  +6  dB  per  octave  (TN-12,  TN-6,  TN  Flat,  and 
TN-H5)  are  representative  of  the  steady-state  noises  in  the 
original  16  noises. 

11.  The  16  equally  speech-interfering  noises  were 
neither  equally  loud  nor  equally  annoying. 

12.  Maximum  noise  level  for  face-to-face  communica¬ 
tion  is  a  500/1000/2000-c/s  SIL  of  95  dB. 

13.  Maximum  500/1000/2000  SIL-  for  speech  communi¬ 
cation  when  using  good  "noise-proofed,  "  sound-powered- 
phones  is: 


a.  84  dB  u  the  talker  is  in  the  quiet  and  the  listen¬ 
er  is  in  noise; 

b.  94  dB  if  both  the  talker  and  listener  are  in  noise; 

c.  114  dB  if  the  talker  is  in  noise,  the  listener  in 

quiet. 

14.  Amplified  speech  communication  with  earphones  is 
possible  in  a  500/1000/2000-c/s  SIL  of  120  dB  if  use  is  made 
of  noise-cancelling  dynamic  or  condenser  microphones, 
noise  shielding  at  mouth  and  ear,  a  speech  bandwidth  of 
three  octaves  or  greater  centered  between  1000  and  1800  c/s, 
a  low  sidetone  level,  AVC,  and  peak  clipping. 

15.  Amplified  speech  communication  with  loudspeakers 
is  possible  in  500/1000/2000-c/s  SIL  noise  levels  of  80  dB. 

If  earplugs  or  passive  earmuffs  are  worn,  this  level  can  be 
extended  to  95  dB. 


RECOMMENDATIONS 


1 .  In  using  SIL  methods  to  rate  noises,  average  the 
sound  pressure  levels  in  the  octaves  centered  at  500,  1.000, 
and  2000  c/s.  This  procedure  is  a  compromise  between 
the  presently  used  600  to  4800-c/s  range  and  the  range 
found  to  be  best  in  this  study,  the  300  to  2400-c/s  range. 

2.  Use  the  newly  developed  SI  contours  as; 

a.  a  new  weighting  method  for  Sound  Level  Meters; 

b.  extensions  of  the  existing  noise -rating  contours. 


3.  Use  o?  the  SI  contours  should  be  evaluated  in  work¬ 
ing  ships'  spaces. 

4.  Loudness  and  annoyance  aspects  of  noises  should  be 
considered  in  future  extensions  of  this  work. 

5.  Spaces  where  conversations  cannot  be  carried  on  in 
comfort  at  3  feet  are  too  noisy  for  tasks  requiring  face-to- 
face  communications.  In  general  this  is  when  the  average 
noise  level  In  the  octaves,  centering  at  500,  1000,  and 
2000  c/s  (the  500/1000/2000-c/s  SIL),  exceeds  70  dB. 

6.  If  the  500/1000/2000-c/s  SIL  exceeds  90  dB,  the 
wearing  of  hearing  protection  should  be  mandatory. 
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PREFACE 


This  report  consists  of  seven  reprints  of  papers  in 
professional  journals  plus  collateral  and  supplementary 
material.  Because  of  this  there  will  be  some  duplication, 
and  the  historical  and/or  logical  order  will  not  always  be 
followed  strictly. 

The  problem  in  terms  of  a  real  naval  environmental 
control  problem  is  stated  in  Section  I  (Introduction).  The 
evidence  that  the  problem  really  exists  is  presented  in  Sec¬ 
tion  II  (a  summary  of  noise  surveys  aboard  a  number  of 
ships).  Sections  HI,  IV,  and  V  are  reprints  of  papers  deal¬ 
ing  with  two  experiments  on  the  physical  and  speech-inter¬ 
fering  properties  of  diverse  spectrum  noises. 

Section  VI  gives  some  details  on  psychophysical 
measurement  methods  for  noises.1*  Section  VB,  which 
has  been  submitted  for  publication,  and  Sections  VIII  and 
IX  (reprints),  propose  new  ways  of  measuring  the  speech- 
interfering  properties  of  noise. 

Section  X  shows  the  important  frequency  regions  in 
noise-masked  speech  in  terms  of  where  in  the  speech  im¬ 
pairment-handicap-disability  scale  the  criterion  is  chosen. 

Section  XI  summarizes  speech  capabilities  in  noise 
and  is  based  primarily  on  evaluations  of  the  speech  intel¬ 
ligibility  of  numerous  communication  systems  and  compo¬ 
nents. 


*  See  list  of  references  at  end  of  report. 
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IHTRODUCTirN 


The  Bureau  of  Ships  has  specifications3’3  limiting 
the  levels  of  noise  in  various  ship  spaces  and  for  emission 
of  noise  by  equipment.  These  specifications  are  based  on 
considerations  of  the  effects  of  noise  on  communications 
and  on  the  potential  deafening  effects. 

This  report  reexamines  the  question  of  speech  inter¬ 
ference  effects  of  noise  with  a  view  toward  simplifying  the 
noise  level  measurement  procedures.  To  accomplish  these 
tasks:  (1)  noise  levels  in  a  number  of  shipboard  spaces 
were  measured  aboard  operating  ships  (and  reviewed  from 
other  shipboard  measurements)  (Section  II);  (2)  represent¬ 
ative  aircraft,  ship,  machinery,  and  office  noises  were 
collected  (Section  III);  (3)  12  of  these  noises  plus  four  lab¬ 
oratory-generated  noises  were  then  spectrum-analyzed 
(Section  III);  (4)  intelligibility  tests  were  conducted  to  find 
the  levels  at  which  these  16  noises  interfered  equally  with 
speech  (Section  III);  (5)  Ahe  16  noises  were  subjected  to  a 
number  of  simple  analyucal  physical  tests  and  measure¬ 
ments  to  find  a  physical  measurement  which  agreed  that  the 
noises  were  equally  speech-interfering  (Section  III);  (6)  the 
variability  among  estimations  of  the  average  noise  level 
obtained  from  observers  reading  moving  coil  meters  was 
determined  (Section  IV);  (7)  complex  calculation  schemes 
based  on  physical  measures  were  applied  to  the  16  noises 
(Section  IV);  (8)  psychophysical  measurement  schemes  were 
applied  (Section  VI);  and  (9)  a  new  set  of  speech  Interference 
Criteria  was  developed  (Sections  VII,  VIII,  and  IX). 

On  the  basis  of  this  reexamination,  two  summary 
papers  were  compiled  that  concern  the  important  frequencies 
in  noise-masked  speech  (Section  X)  and  noise  limitations  on 
speech  (Section  XI). 

Related  studies  were  conducted  to  determine  the 
relationships  between  (1)  noise  levels  and  speech  levels,4 
and  (2)  the  angle  between  talkers  and  listeners  in  face-to- 
face  communications. 5,6  These  are  presented  in  Appendixes 
A,  B,  and  C. 
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MEASUREMENT  OF  SHIP  NOISES 


To  obtain  experience  with  the  kinds  of  ieveis  of 
noise  aboard  ships,  noise  levels  were  measured  on  three 
Navy  vessels,  the  aircraft  carriers  USS  ORISKANY  (CVA  34) 
and  USS  TICONDEROGA  (CVA  14),  and  the  missile  cruiser 
USS  CANBERRA  (CAG  2).  Measurements  were  made 
during  sea  trips  taken  in  conjunction  with  two  other  problems 
dealing  with  communications  in  the  Combat  Information 
Center  (CIC)  and  on  the  flight  deck  of  aircraft  carriers.  To 
some  degree,  therefore,  the  sampling  tended  to  be  concen¬ 
trated  in  areas  occupied  by  CIC  and  Air  Department  person¬ 
nel. 

Figures  II— 1  and  II-2  summarize,  in  histogram  form, 
157  noise  levels  measured  aboard  the  three  ships.  Except 
for  a  few  large  compartments,  each  measurement  represents 
a  single  compartment.  The  uppermost  histogram  of  figure 
II- 1  shows  the  distribution  of  overall  or  C  scale  levels. 

All  measurements  were  made  with  calibrated  General  Radio 
Company  Type  1551-Aor  1551 -B  Sound  Level  Meters.  The 
arrow  at  86  dB  indicates  the  median  level.  The  next  histo¬ 
gram  depicts  sound  levels  obtained  with  the  A  weighting  7  of 
the  sound-level  meter  for  the  same  set  of  15?  measurements. 
For  this  distribution  the  median  is  76  dB. 

The  two  lower  histograms,  figure  II-2,  present  over¬ 
all  (C  scale)  and  A-weighting  levels  measured  in  spaces  in 
which  satisfactory  speech  communication  was  judged  by  the 
measurement  team  to  be  important.  The  64  measurements 
selected  from  the  157  measurements  correspond  roughly  to 
the  A  category  of  Ships  Specification  SI-10.2  The  median  C 
and  A  levels  for  these  spaces  are  82  and  70  dB,  respectively. 

Figure  U-3  presents  histograms  of  noise  levels 
measured  by  Jensen  and  Soroka  aboard  the  aircraft  carrier 
USS  CORAL  SEA  (CVA  43). 8  The  upper  histogram  is  based 
on  overall  levels  calculated  from  listed  octave  band  levels 
and  the  lower  histogram  is  based  on  Speech  Interference 
Levels9  of  the  same  measurements.  Median  levels  are  93 
and  73  for  the  two  distributions  of  60  measurements  each. 

Additional  data  on  noise  levels  aboard  U.  S.  Navy 
ships  are  available  in  reports  from  the  Material  Laboratory, 
New  York  Naval  Shipyard. 10  Median  overall  levels  were 
about  84  dB  for  44  measurements  on  USS  BORIE  (DD  704), 
about  90  dB  for  20  measurements  on  USS  TIMMERMAN 
(EAG  152),  and  about  80  dB  for  USS  TICONDEROGA  for 
19  measurements  taken  with  no  aircraft  in  operation. 

Speech  Interference  Levels  (the  300  to  4800-c/s, 
four-band  average)  gave  median  values  of  80  dB  for  20 
measurements  on  USS  TIMMERMAN  and  of  66  dB  for  20 
measurements  on  USS  TICONDEROGA. 


ALL  COmPARTiVXNTS 


Figure  II— 1 .  Distribution  of  noise 
levels  in  representative  compartments 
on  two  aircraft  carriers  and  one  heavy 
cruiser.  The  upper  histogram  is  mea¬ 
sured  with  the  C- ,  the  lower  with  the 
A-weighting,  of  a  sound  level  meter. 
Arrows  point  to  median  values. 
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Figure  II-2.  Distribution  of  noise 
levels  ir.  only  those  compartments  where 
speech  communications  were  judged  to  be 
necessary  on  two  aircraft  carriers  and 
one  heavy  cruiser.  The  upper  histogram 
is  measured  with  the  C- ,  the  lower  with 
the  A-weighting,  of  a  sound  level  meter. 
Arrows  point  to  median  values. 
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SPEECH  INTERFERENCE  LEVEL  IN  OB 


Figure  I I -3.  Distribution  of  noise 
levels  abccrd  US, S  CORAL  SEA  (CVA  43). 

The  upper  histogram  represents 
C  levels,  the  lower  is  the  average 
level  in  the  four  octaves  from  300  to 
4800  c/s,  the  fou: — octave  Speech 
Interference  Level  (SIL). 

Arrows  point  to  median  values. 

A  later  survey  of  shipboard  noises  by  the  Southwest 
Research  Institute11  concludes  ..  that  airborne  sound 
levels  generated  by  machinery  items  are  above  specifica¬ 
tions  in  several  shipboard  spaces." 

Figures  II-4  through  n-11  summarize  the  SRI 
noise  measurements  and  the  BUSHIPS  specifications  ap¬ 
plicable  at  the  time.  The  measurements  reported  were 
made  aboard  the  aircraft  carriers  USS  ENTERPRISE  (CVA 
65),  USS  FORRESTAL  (CVA  59),  USS  KITTY  HAWK  (CVA 
63),  USS  RANGER  (CVA  61),  and  USS  HANCOCK  (CVA  19) 
from  1959  to  1962;  on  various  conventional  and  nuclear 
submarines  in  1958  and  1963;  and  destroyers  in  1956  and 
1963.  The  authors  measured  noise  levels  only  aboard 
DD's  849  and  858  and  CVS  18  in  March  1964.  In  general 
the  noises  in  Category  D  and  E  spaces  were  measured  at 
full  power  runs  and  other  spaces  during  "endurance '*  run¬ 
ning  conditions. 

The  noise  levels  aboard  merchant  ships  of  the 
Netherlands13  and  Norway13,14  varied  from  65  dB(A)  or  95 
dB(C)  in  cabins  to  105  dB(A)  or  110  dB(C)  in  Engine  Rooms. 

Noise  levels  on  the  navigation  (pilot)  bridge  of  24 
German  ships15  varied  between  70  and  102  dB  at  31  c/s  and 
between  40  and  55  dB  at  200G  c/s,  with  spectra  falling  about 
10  dB  per  octave  from  31  to  250  c/s  and  about  5  dB  per 
octave  above  250  c/s.  On  the  average,  C  levels  were  90  dB 
at  "full  speed  ahead"  (voile  Fahrt)  and  80  dB  at  "stop. " 
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Figure  II-4 .  The  range  for 
three  Category  A  spaces  fair 
craft  carriers ).  SIL  limits 
CIC  -  50  dB;  Chart  Boon  and 
Radio  Control  **  60  dB. 
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SAMPLINGS  OF 
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Figure  11-6.  The  range  for 
three  Category  B  spaces 
( desi  ro  yersj  . 
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Figure  11-7.  The  range  for 
two  Category  C  spaces. 
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RANGE  Of  MEASURED  NOISE  LEVEES  IN  AN  ENGINE  ROOM 
MIL-S-l-lOFOS  THE  CATEGORY  D  SPACE 
PERCENTAGE  Of  READINGS  EXPECTED  TO  BE  ABOVE  AND 
BELOW  THE  MILITARY  SPECIFICATION 


OCTAVE  PASS  BANDS  OS 


Figure  1 I~S .  The  range  for 
a  Category  D  space  (conven¬ 
tional  submarine  -  full 
power  run). 


RANGE  Of  MEASURED  NOISE  LEVELS  IN  THE  ENGINE  ROOM 
MIL-S'1-10  ICATSGQRY  D) 

PERCENTAGE  OF  READINGS  EXPECTED  TO  BE  ABOVE  AND 
BELOW  THE  MILITARY  SPECIFICATION 
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Figure  11-9.  1  he  range  for 

a  Category  D  space  (nuclear 
submarine  -  full  power  run,'. 
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i-Jr  RANGE  Of  MEASURED', NOISE  LEVELS  IN  THE  ENGINE  ROOM 
DURING  FULL  POWER  RUNS 

-  MIL  - S- 1-10  (CATEGORY  0) 

-  PERCENTAGE  OF  READINGS  EXPECTED  TO  6E  ABOVE  AND 

8EIOW  THE  MILITARY  SPECIFICATION 
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Figure  11-10.  The  range 
for  a  Category  D  space 
(19  destroyers )  . 


3  RANGE  OF  NOISE  LEVELS  FOR  THE  FIRE  ROOM 

-  MIL-S- MO  'CATEGORY  D) 

-  PERCENTAGE  OF  READINGS  EXPECTED  TO  BE  ABOVE  AND 

Saw  THE  MILITARY  SPECIFICATION 
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Figure  11-11.  The  range 
for  a  Category  D  space 
(surro  ru)  . 


In  summary: 


1.  Overall  levels  on  three  aircraft  carriers,  a  missile 
cruiser,  and  two  destroyers  ranged  from  70  dB  to  over  120 
dB;  median  sound  pressure  levels  for  five  different  groups 
of  measurements  were  86,  93,  84,  90,  and  80  dB. 

2.  The  A-weighting  levels  taken  aboard  two  carriers 
and  a  cruiser  measured  from  54  to  116  dB  with  a  median 
value  of  76  dB.  Merchant  ship  values  ranged  from  65  to 
105  dB. 

3.  Speech  Interference  Levels  for  two  carriers  and 
one  destroyer  measured  from  55  dB  to  1 00  dB  with  median 
values  of  66,  80,  and  73  dB.  Ship  specifications  Section 
SI-102  give  maximum  permissible  SIL’s  of  50,  55,  and  60 
dB  for  various  sized  category  A  compartments  (in  which 
speech  communication  is  important)  and  72  dB  for  category 
E  compartments  (in  which  deafness  avoidance  is  a  consider¬ 
ation,  but  a  certain  amount  of  speech  communication  is 
necessary).  Obviously,  with  median  measured  SIL’s  of 

66,  80,  and  73  dB  and  estimated*  median  SIL's  of  about  66 
dB  and  62  dB  for  the  data  of  figure  n-1,  the  noise  level  in 
Navy  ships  is  high  enough  to  produce  speech  interference 
problems.  It  is  not  surprising  that  several  noise  surveys 
conclude  with  statements  to  the  effect  that  SI- 10  maximums 
were  exceeded.  9,1°’ 11 

It  should  be  noted  that  although  noise  is  a  problem 
in  the  Navy,  not  all  ships  or  compartments  can  be  classi¬ 
fied  as  noisy.  Brief  informal  observations  aboard  the  air¬ 
craft  carrier  USS  RANGER  (CVA  61),  in  June  and  August 
1962,  indicated  a  number  of  locations  that  were  relatively 
free  of  noise  (about  50  dB  on  the  C  scale  of  a  sound-survey 
meter).  The  RANGER  is  a  relatively  r  ew  ship  (commis¬ 
sioned  in  1957),  and  from  the  observed  widespread  use  of 
sound-absorbing  and  sound-isolating  material,  it  appears 
that  noise  may  be  a  lessened  problem  in  some  late  model 
ships. 


*  For  ship  noise  the  median  A  level  appears  to  be  about  10 
dB  below  the  median  C  level  and  the  median  four-band  STL 
about  10  dB  below  the  A  level  (fig.  II- i  and  11-2}.  This 
generalization  should  not  be  soul  led  to  individual  measure- 
ments.  but  probably  holds  true  for  the  general  class  of 
ship  noise  because  of  the  all-pervading  influence  of  low 
frequency  noise  from  blowers  and  propulsion  machinery. 
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The  sound  level  c$  each  of  16  diverse  coses  was  adjusted  so  that  Ioodspe2ker-iepTodocxd  rhyme  words 
at  a  level  of  78  dBiC)  at  one  meter  vrire  reduced  to  50%  void  intriEjpHEty.  Spectrum  analyses  (octave 
bond  and  2&qs  hand),  physical  measurements  (C.  B,  A.  DIN  3  sound-level  meter  weightings),  calculaliocs 
[Speech  Interference  Levels  (SIL)l,  and  peak  £tticgs  [to  noise  criteria  (XO-lype  curves j  mere  made  to 
find  which  method  came  closest  to  agreeing  that  the  noises  were  equally  speech-interfering. 

SIL  cakrcbtioQs  that  iadaded  the  octave  300-600  cps  predicted  the  speech-interference  aspects  of  the 
16  noises  best.  The  A  and  DIN  3  weighting  networks,  the  oonveatkoal  (600-4500  css)  SIL,  end  use  of  a 
restricted  terioa  (359-2HJ0  cps)  of  NC-type  curves  gate  the  nest  best  prediction.  The  poorest  predictor? 
were  the  B  cr  C  &?qxseaa--’=«ighling  networks  and  tse  of  the  XCA,  XC,  or  ISO  costoois  over  their  total 

frrqisency  range. 


DITIIODIJCTIOK 

OMPARED  to  most  civilian  tvoik  ate as,  the 
ambient  noise  levd  in  ship  compartments  is  high. 
Noise  in  ship  spaces  comes  from  a  variety  ei  sources 
such  as  blowers,  pomps,  generators,  public  address 
systems,  radio  rece3Yers,and  men.  With  the  ship  under¬ 
way,  noise  in  iower-krvd  compartments  is  augmented 
by  noise  from  the  propulsion  gear  and  from  hull  vibra¬ 
tion.  Noise  control  is  an  ever-present  problem  cm  ships. 
Noise  control  on  future  ships  is  apt  to  present  even 
greater  problems  dee  to  the  increased  raise  potential 
of  future  weapon  systems.  SunpSSed  naise-juting 
methods  be  needed. 

This  report  re-examines  tie  speech  interference  as¬ 
pects  c-i  muse,  sab  a  view  towards  slmpEfying  noose 
Vvd  measurement  pr-xedmes.  To  srro—phdi  this  - 
(1)  noise  kvds  in  a  number  of  shipboard  spaces  mere 
measured  aboard  operating  ships,  (2)  representative 
aircraft,  snip,  machinery,  and  oeFce  noises  were  ad¬ 
jected  and  spectrum  analysed,  (3)  intcfnglhmtT  tests 
wee  ccndc-cted  to  and  the  levels  at  which  16  of  these 
noises  interfered  equally  with  speech,  and  (4)  the  16 
smses  were  subjected  so  a  number  of  analytical  physical 
tests  and  measurements  to  and  a  physical  measurement 
that  agreed  that  the  noises  were  eqatSv  speech-inter¬ 


level  required  for  successful  commurucation  in  a  face-to- 
face  shaaOcn,1-  to  determine  the  variability  among 
estimations  of  the  average  noise  levd  obtained  from 
observers  reading  moving-coil  meters,3  and  to  find 
psychophysical  measurements  of  the  noises  that  cor¬ 
relate  with  speech  interference.4 

L  COLLECTION  OF  902SS  SAMPLES 

The  first  phase  of  the  study  to  determine  the  speech- 
interfering  properties  of  dim  noises  was  to  obtain  a 
representative  coDectko  of  recordings  of  such  noises. 
The  noise  found  aboard  an  aircraft  earner  intrudes 
many  of  the  noises  found  on  other  surface  ships  and  in 
addition  isdndes  some  peculiar  to  ah  craft.  For  this 
reason,  most  cs  ibe  ship  noise  samples  wee  collected  on 
aircraft  carriers.  Recordings  were  made  in  Irving  spaces, 
communication  spaces,  engineering  spaces,  and  on  or 


5  J.  C  Wt^wsad  R.  G.  Klsso,  *E5s»  rf  Asaset  Noise 
ra  XaAv  l alters  ca  a  Facr-to-Facr  Ccsswssacatsca  Task,*’ 
J.  AuxsL  See.  An.  K  936  (WS2). 

;R.G.Sjsoo3LH.Kh;ifflBCsa£aSl«dhorid 
a  Talks’  to  Ascs  the  Speech  lats&aeace  ec  A-sftpcfl  Nassc’" 
(to  he  peHEseo). 

*  it  G.  Dsso  aad  J.  L.  Lecascd,  Tfeseig  VraiSy  is 
Srrs.rr~:  Xrise  Levels  -=T£h  Meters,”  Sccsff — Its  Uses  od  Ccctrri! 
2,  No.  4. 25-29(1965). 

*S.  G.  Sss9  asd  J.  C  Webster,  ^PreScriss  Speech  I attr- 
fscacr  frees  Ursxxl  cd  gprfaAjgd  Measures  off  Asffks! 
Mrac"  J.  AcaaSL  See.  As.  S.  IlfS  (A)  (1963). 


firing.  Gojev  related  studies  were  coacsciea  to  dt- 
'enrhe  the  relationship  between  raise  krvri  and  speech 
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SPEECH-INTERFERING  NAVY  NOISES 


about  the  flight  deck  with  jet  and  propeller  aircraft 
operating.  For  a  ship’s  diesel  engine  noise  a  sample 
furnished  by  Liibcke1  was  used.  Other  recordings  were 
made  in  machine  shops  and  oflree  spaces  back  in  the 
laboratory.  To  complete  the  sampling,  three  standard 
laboratory  noises  were  added:  random  noise,  flat  or 
unweighted ;  random  noise  shaped  at  -f  6  dB  per  octave; 
and  random  noise  shaped  at  —6  dB  per  octave.  These 
are  later  identified  as  TX  Fiat,  TN-f-6,  and  TX— 6. 

Equipment  used  to  record  noise  samples  included  a 
selected,  omnidirectional  Altec  Lansing  type  633A  mi- 


K>M  MU  luU  «•  null  HUM 


Fig.  I.  Octave  band  and  3kjoc  bud  ievdk  os"  zxese  ssspk  1 
{sip's  nnabSe)-Tljis  is  a  iaHr  scah  ,  vGj  -Joas.tHpcxT  rmahfc. 
A  15-sec  log  aas&ode  vs  time  trace  is  samra  at  tie  gipcr 
Tbr  over-aS  *C”  fcvd  is  satoe  at  fc  enraac  kit.  A:  the 
t*ss«a  aie  sstes  rt|seststsi  tic  octave  iasd  ktxs  in  the 
ectaves  150-500.  SOO-MO.  600-1200.  1200-2400.  24!#-4SOa.  =ne 
4Srt>-9fiaa  qe.  Le_  these  astert  on  she  series  212, 425.  etc.  The 
octave  bod  tss  20<vde  bsud  cata  are  oot  directly  ocgsabfe 
sac;  these  arr  icasszk  rakses  made  css  are  «E5etret  pintod 


oophorc,  a  General  Radio  Company  type  1551A 
sound-level  meter,  and  an  A  rapes  Corporation  type  ©1 
recorder.  In  recording  noise  samples,  the  microphone 
*a«  positioned  at  places  normally  occupied  ire  people 
performing  their  jobs. 

Of  150  recordings,  16  samples  were  selected  to 
represent  both  intermittent  and  steady-state  noises,  to 
incfcde  both  nigh-  and  km -frequency  noises,  and  to 
induce  a  number  of  noises  commonly  found  aboard 
operating  ships.  A  short  description,  together  with  leg 
amplitude  vs  time:  trace  3nd  a  spectral  analysis,  is  shown 
for  each  noise  in  f-igs.  1  through  16- 


*  Eess  Lajclre.  'Txie-Msdssssesa.  1955-3CS”  fjcivate 


octm  Fin  11*3$  •  dun  n*  t:c>c 


Fw.  2.  Octave  band  and  20-cycie  Land  levels  of  noise  samirie 
(jyab  c-npinc)-  This  is  an  electrical  n»l«*r  of  iarec  size  with  slncsi 
kw-freqaatr  haslit  nawnaits.  See  Re.  1  for  other  details. 


1L  ANALYSIS  OF  THE  10  NOISES 

The  magnetic  tape  recordings  of  the  noise  samples 
were  cut  into  loops  of  10  to  15  sec  duration  and  re¬ 
produced  ever  an  Ampex  Corporation  type  401  re¬ 
corder.  The  signal  from  the  recorder,  amplified  by  t  xo 
20-W  and  one  30-W  McIntosh  Corporation  amplifiers, 
was  fed  to  an  Acoustic  Research  loudspeaker,  model  3, 

zc~-m*t  m;j  j*  H-.U  tust 


rRECWsSCr  ssi  CTC5.ES  PE*  SECOW 

Ik.  i  Octave  band  and  2kvde  o-nd  levels  cs  noise  srrrqde  5 
{Uwj).  Tins  is  a  straw,  ira -tnrrzrzKy  Uww  aasr.  See  Fic.  I 
fee  caber  deSask. 


(#r»  K> 


G .  KLUMPP  AND  j .  C .  WEBSTER 


1330 


IBIMSRI 


3CT«*C  (ft  CrZi.lt  ft*  St 

—  -  m  —  »*» 


\{/ 


?  J  i  |  er 


FREOUEtfCY  tM  CYCLES  PER  SECOSW) 

Fro.  4.  Octave  hand  and  20-o  clc  band  levels  of  noise  sample  4 
(thermal  noise  sloped  ar.  minus  6  dB  per  octave).  This  is  very 
steady,  lew-frequency  ntisc-  See  Fig.  1  for  other  details. 


FREQUENCY  1*  CYCLES  PER  SECOWJ 

Ftc.  6.  Octave  band  and  20-cyde  band  1  .els  o?  noise  sample  6 
(power  shear),  an  extremely  intermittent  noise  of  a  hydraulic 
power  shear  cutting  metal,  including  clang  of  sheet  metal  dropping 
to  floor.  See  :z.  1  for  other  details. 


and  an  Altec  Lansing  604  loudspeaker  system  on  one 
ade  and  to  an  Altec  Lansing  820  loudspeaker  18  feet 
away  on  the  other  side  of  t  studio.  The  dimensions  of 
the  studio  were  27  ftX  16  fix  10  ft,  and  its  reverberation 
time  was  about  0.3  sec  at  frequencies  up  to  1  kc,  sec, 
rising  to  about  0.5  sec  at  4  kc  sec.  For  analysis,  the 
acoustic  signal  was  picked  up  by  a  condenser  micro¬ 


phone  located  midway  between  the  two  loudspeaker 
systems. 

After  the  speech-interference  tests  were  completed, 
octave  band  and  other  physical  measurements  of  the 
acoustic  reproduction  of  cadi  of  the  16  recorded  samples 
were  made.  A  Bniel  and  Kjaer  type  2 <03  RMS  meter 
and  a  pair  of  Allison  filters  with  a  coir.bii.ed  rejection  of 
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FRECySJjrr  CYCLES  PER  SECOND 

Fig  5.  Octave  band  and  20-cydc  band  levels  of  noise  sample  5 
•blower  and  hall),  a  fairly  steady,  krw  frequency  ramble  and 
•srachiije  noise.  See  Fig.  1  for  other  details. 


FREQUENCY  I*t  CYCLES  PER  SECOND 

Fro.  7.  Octave  hand  and  20-cyck  b\nd  levels  of  noise  samcJc 
(gem  rater),  which  has  whine  ana  law  -frequency  casmtoec: 
with  small  wgula  changes  in  level.  See  Fig.  1  for  other  ocJa3s 
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FREQUENCY  IN  CrCl.ES  PER  SECOND 

Fig.  8.  Octave  band  and  2C-cycie  band  levels  of  noise  sample  8 
(a  compress^.  with  rhythmic  sound  pattern).  See  Fig.  1  for  other 
details. 

60  dB/ octave  were  used  to  make  these  measurements. 
The  octave  band  measurements  are  presented  in  Figs.  1 
through  16,  together  with  the  log  amplitude  vs  time 
traces.  A  20-cps-band  analysis,  made  before  the  final 
loudspeaker  configuration  was  determined,  is  also  given 
m  these  same  figures  for  reference,  although  these  data 
are  not  directly  comparable  to  the  octave  band  data. 

ocim  Hsi  »anc>i  m  cyclcs  »r.«  iccono 
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FREOUENCY  IN  CYCLES  PER  SECOND 

Fig.  9.  Octave  band  and  20-cycle  nand  lev.  S  of  noise  cample  9 
(voice  babble  from  Ref.  1).  This  consisted  of .  pairs  of  communi- 
catorsexdiMgiiig  monosyllabic  words,  resulting  in  irregular  level 
(20  dB)  and  frequency  changes.  Sec  Fig.  1  for  other  details. 


FREQUENCY  IN  CYCLES  PER  SECOND 

Fig.  10.  Octave  band  and  20-cvcle  band  levels  of  noise  sample 
10,  a  relatively  flat,  relatively  constant  level  thermal  noise  with  a 
wide-band  hiss,  relatively  constant  in  level.  See  Fig.  1  for  other 
details. 

The  over-all  levels  (C  weighting)  on  Figs,  i  through  16 
represent  equally  speech-interfering  levels;  the  deter¬ 
mination  of  and  the  interpretation  of  these  over-all 
levels  are  discussed  later. 

The  noise  samples  were  numbered  on  the  basis  of 
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FREOUENCY  IN  CYCLES  PER  SECOND 

hio.  ll.  Octave  band  and  20-cyclc  band  level  of  noise  sample  11 
(arresting  gear).  This  noise,  which  was  recorded  in  an  uiresting- 
gca.  room  of  a  carrier,  has  rumble,  shriek,  bang,  and  roar  com¬ 
ponents,  and  fluctuates  over  a  20-dB  range  in  an  irregular  fashion. 
See  Fig.  1  for  other  details. 
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Fro.  12.  Octave  band  and  20-cycle  band  levels  of  noise  sample  12 
(engine  room).  rh:s  complex  machinery  noise,  furnished  by 
Liibcke,  is  relatively  steady  in  level.  Sec  Fig.  1  for  other  details. 


frequency  content,  the  low-numbered  noises  being  rich 
in  lcw-frequency  sound  and  the  higher-numbered  noises 
rich  in  high-frequency  sound.  The  noises  numbered  from 
about  6  through  12  have  most  of  their  energy  in  the 
mid  frequency  region.  The  amplitude  ys  time  traces 
show  that  some  of  the  samples  are  relatively  constant 
in  level  while  others  vaiy  as  much  as  20  dB  in  level 
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Fig.  13.  Octave  band  and  20-cyclc  band  levels  of  noise  sample  13 
(air  grinder).  This  sound,  similar  to  a  “dental  drill”  sound,  was 
produced  by  an  air-driven  grinder  in  contact  with  a  metal  sheet. 
See  Fig.  1  for  other  details- 


every  15  sec.  This  ranking  system  was  not  used  as  the 
order  in  which  the  speech-interference  tests  were  run. 

III.  EQUALLY  SPEECH-INTERFERING 
LEVELS  OF  NOISE 

The  aim  of  tit's  portion  of  the  study  was  to  determine 
the  sj>eech-interfering  effects  ot  each  of  16  noise  samples 
by  means  of  the  procedure  known  as  the  speech-intel¬ 
ligibility  test.  In  such  a  test,  listeners  are  presented  a 
mixture  of  sp°ech  and  noise,  asked  to  write  down  the 
speech  the)-  hear,  and  scored  on  the  correctness  of  their 
responses.  In  this  study,  listeners  heard  constant  level 
speech  from  a  loudspeaker  in  the  presence  of  each  of  16 
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Fic.  14.  Octave  band  and  20-c>cle  b  md  levels  of  noise  sample  14 
(typewriter).  This  noise  is  strong  in  high-frequency  components 
and,  although  the  level  fluctuates  rapidly  from  moment  to  moment, 
the  over-all  level  is  fairly  constant.  Sec  Fig  1  for  other  details. 


different  ambient  noises.  The  level  cf  each  of  the  noises 
was  adjusted  to  give  equal  speech  interference,  i.e.,  50% 
of  the  words  heard  correctly.  The  levels  of  the  16  noises 
were  then  compared  to  find  tnc  measurement  procedure 
or  system  according  to  which  the  measured  levels  were 
most  nearly  equal. 


Procedure 

Test  words  were  recorded  using  a  Radio  Corporation 
of  America  type  BK6B  lavalier  microphone  in  order  to 
minimize  changes  in  recorded  voice  level  resulting  from 
movements  of  the  talker  with  respect  to  the  microphone. 
The  single  male  talker  used  had  been  found,  in  previous 
noise-masked  tests,  to  be  average  or  slightly  above 
average  in  intelligibility  when  compared  with  other 
laboratory  personnel.  To  simulate  the  performance  of  a 
talker  in  noise,  the  talker  spoke  at  a  level  approxi- 
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match'  S  dB  above  norma!  during  recording.  This  in¬ 
creased  vocal  output  was  achieved  by  having  the  talker 
first  deliver  words  with  normal  vocal  effort  without 
watching  a  VU  meter  in  the  recording  system.  The 
average  VU  meter  deflection  output  was  noted  and  8 
dB  of  attenuation  was  added  to  the  meter  circuit.  The 
talker  was  then  instructed  to  record  using  the  VU  meter 
and  to  adjust  his  vocal  output  to  the  level  which  pro¬ 
duced  the  same  meter  deflection  obtained  in  the  normal 
voice  condition.  Actual  levels  were  set  on  the  carrier 
word  “write,”  which  was  given  before  each  test  word. 

Rhyme6  words  were  recorded  at  a  rate  of  one  word 
every  two  seconds.  This  two-second  rate  is  faster  than 
that  used  by  Fairbanks  but  has  been  found  to  be  satis¬ 
factory  in  previous  tests.7  The  rhyme  test  is  essentially 
a  test  of  consonant  discrimination  with  the  listener 
required  to  fill  in  the  first  letter  of  an  incomplete 
monosyllabic  word. 

OCTtvc  Mt  tt«OS  **  C*C-.t3  Jtcoao 


Fig.  15.  Octave  band  and  20-cyclc  band  levels  of  noise  sample  15 
(thermal  noise  slojicd  at  plus  6  dB  [>er  octave).  This  is  a  high-fre- 
uenev  hiss  with  a  relatively  constant  level.  See  I-'ig.  1  for  other 
etails. 

One  hundred  lists  of  50  rhyme  words  each  were  re¬ 
corded  over  a  4-day  period.  After  elimination  of  all  lists 
with  errors  or  ambiguous  pronunciations,  five  blocks  of 
10  lists  each  were  assembled,  with  each  block  having 
equal  representation  from  each  of  the  four  days  of 
recording. 

For  the  intelligibility  tests  the  iisfs  were  played  buck 


*G.  Fairbanks,  “Test  of  Phonemic  Differentiation;  The  Rhyme 
Test,1'  J.  Acoust.  See.  Am.  30,  596  (1958). 

7  J.  C.  Webster  and  R.  G.  Klumpp,  “USXEL  Flight  Dtxk 
Communications  System.  Part  2.  Noise  ar.d  Acoustic  Aspects,’’ 
Ij.  S.  Xaw  Ehxtronics  Laboratory  Report  923,  AD-260  286 
(1960),  footnote,  p.  19. 


MT«  R»  (Utl  «  CtCUl  I1CM 


Fig.  16.  Octave  band  and  20-cyde  band  levels  of  noise  sample  16 
(jet  idle).  This  recording  of  a  jet  engine  on  liic  flight  deck  of  a 
carrier  has  strong  whine  components  and  has  several  am  upt  drops 
in  level. 

over  a  University  MLC  loudspeaker  mounted  on  an 
adjustable  stand.  The  frequency  response  of  this  load- 
speaker,  measured  3  ft  from  the  face  of  the  speaker 
(head  position  of  the  listener),  is  shown  in  Fig.  17. 
Playback  level  of  the  speech  at  the  listener’s  position 
as  measured  on  a  sound-level  meter  set  to  “C”  scale, 
“fast,”  was  approximately  78  dB. 

Ambient  noise  signals  were  provided  by  the  playback 
system  described  earlier  in  See.  II,  and  the  physical 
measurements  of  the  noises  (Figs,  t— 161  were  made  with 
the  microphone  at  a  position  corresponding  to  the 


Fig.  1).  Frequency  response,  in  third-octave  bands,  of  loud¬ 
speaker  ored  to  reproduce  rlivme  words.  (Add  4.9  dB  to  obtain 
octavc-band  level.)  Gppsr  curve:  Electrical  response  ot  thermal 
noise.  SILC  curve:  Acoustical  output  of  MLC  speaker  to  thermal- 
noise  input.  Bottom  Difference  between  electrical  input  and 
acousucai  output. 
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center  of  the  listener’s  head.  The  noise  measurements, 
therefore,  represent  some  sort  of  an  “average”  noise  at 
the  listener's  head  position. 

The  listener  was  seated  at  right  angles  to  and  midway 
between  the  two  speaker  arrays  located  18  ft  apart. 
Although  the  noise  sources  were  located  to  each  side 
of  the  listener,  noise  was  loca'ized  as  coming  from  above 
(equal,  in-phase  signals  at  each  ear).  The  loudspeaker 
emitting  the  word  lists  was  located  in  front  of  the 
listener,  at  ear-level  height,  and  with  the  face  of  the 
speaker  approximately  36  in.  from  the  ears  of  the 
listener.  The  listener’s  chair  was  fixed  in  position  and, 
because  of  the  proximity  of  the  small  table  on  which  he 
wrote  his  answers,  he  could  not  move  more  than  an 
in«:p  or  two  in  aiy*  direction. 

The  intelligibility  tests  in  this  experiment  differ  from 
tin:  LvUil  eirphone  or  single  loudspeaker  tests  in  that 
the  speech  and  noise  sources  were  spatially  separated. 
Th's  p.eans  that  the  listener  could  “selectively  attend” 
tc  ;  -•  speech.  Precisely  how  this  might  affect  the  results 
is  -.oi.  cei  tain.  The  listener  could  move  his  head  over  a 
restricted  area  and  hence  could  position  his  head  with 
respect  to  the  speech  source  in  front  of  him  and  the 
noise  sources  at  both  sides.  In  this  manner  he  could 
maximize  speech-to-noise  ratio.  In  particular,  this  pro¬ 
cedure  might  be  effective  against  high-frequency-noise, 
standing-wave  patterns,  such  as  existed  in  the  jet  idle 
noise  (#16),  which  contained  several  strong,  high- 
frequency  tonal  components. 

All  listeners  were  college  students  with  normal 
hearing  (no  loss  greater  than  10  dB  in  either  ear  from 
125  to  0000  cps  as  measured  on  a  Jjekcsy-iype  audiome¬ 
ter).  Five  males  and  three  females  in  the  17-21-year  age 
bracket  served. 

The  testing  sequence  for  each  listener  was  as  follows: 
(1)  audiometric  test,  (2)  practice  in  quiet  (10  word 
lists),  (3)  piactice  in  noise  (10  lists  in  each  of  4  levels 
of  unweighted  random  noise),  (4)  main  tests  in  which 
10  lists  were  presented  in  each  of  the  16  noises,  and 
(5)  a  retest  with  the  same  block  of  10  lists  under  the 
same  conditions  as  in  the  latter  portion  of  the  “practice 
in  noise”  sequence  to  assess  the  effects  of  learning. 

Listeners  were  tested  individually.  During  the  main 
tests,  a  single  session  for  a  giver,  listener  consisted  of  the 
presentation  of  10  lists  in  a  noise,  a  15-minute  rest,  and 
then  the  presentation  of  10  more  lists  in  a  second  noise. 
Each  listener  served  in  two  sessions  per  day,  one  in  the 
moitiing  and  one  in  the  afternoon,  until  the  required  12 
test  sessions  were  completed. 

Blocks  of  10  lists  were  counterbalanced  among 
listeners  and  nc-iscs,  and  the  sequence  of  r-oise  samples 
was  counterbalanced  among  listeners  and  afternoon 
and  morning  sessions. 

Results 

Table  I  lists  percentage  of  words  correct  obtained 
from  each  of  the  1.6  noises.  Each  percentage  :s  based  cn 


Table  I.  Percent  words  correct  vs  over-ai!  noise  level. 


Noise 

Percent 

Correct 

C  Scale- 
reading 
in  dB 

Correction 
in  dB 

Equally 
Interfer¬ 
ing  **C” 
Level  in 
dB 

1  Ship  Rumble 

40.5 

107.2 

-1.9 

105.3 

2  Grab  Engine 

52.5 

97.1 

0.5 

97.6 

3  Blower 

50.1 

92.3 

0.0 

92.3 

4  TX-6 

42.6 

883 

-1.5 

86.8 

5  Blower  &  Hull 

52.2 

84.9 

0.4 

85.3 

6  Shear 

54.5 

77.9 

0.9 

78.8 

7  Generator 

34.7 

82.3 

-3.0 

79.3 

8  Compressor 

49.3 

81.2 

0.2 

81.0 

9  Babble 

51.9 

79.9 

0.4 

803 

10  TN  Fiat 

53.1 

80.1 

0.6 

80.7 

11  Arresting  Gear 

44.8 

86.3 

-1.0 

853 

12  Engine  Room 

55.7 

85.1 

1.2 

863 

13  Air  Grinder 

48.0 

81.7 

-0.4 

S4.3 

14  Typewriter 

52.7 

85.9 

0.5 

86,1 

15  TN+6 

52.8 

87.7 

0.6 

883 

1 6  Jet  Idle 

51.4 

93.5 

0.3 

93.8 

Average 

49.2 

4000  words  (8  listenersX‘10  listsXSO  words  per  list). 
Column  4  lists  the  sound-pressure  level  of  each  noise 
(C  scale  of  sound-levei  meter)  as  it  was  set  during  the 
intelligibility  tests.  Based  on  the  assumption  that  a 
change  of  1  dB  in  speech-to-noise  ratio  would  produce  a 
change  of  5 %  in  the  percentage  of  words  heard  correctly, 
column  5  lists  the  amount  in  dB  to  be  added  to  column  4 
to  produce  a  score  of  50%.3  The  sum  of  columns  4  and 
5,  column  6,  yields  equally  speech-interfering  levels  of 
these  noises  expressed  in  terms  of  the  C  weighting  net¬ 
work  of  the  sound- ievei  meter. 

Au  average  score  of  99.8%  was  obtained  for  the  500 
words  presented  to  each  listener  in  the  quiet  (6  listeners 
heard  every  word  correctly,  one  person  missed  1  word 
in  500,  and  one  missed  8  words  in  500). 

Pre-  and  post-test  scores  for  the  group  of  8  listeners 
were  65.7%  arid  66.8%,  respectively.  The  difference 
1.1%  was  not  statistically  significant.  A  difference  equal 
to  or  greater  than  4.7%  would  be  required  to  be  signifi¬ 
cant  at  the  5%  level  of  confidence. 

An  analysis  of  variance  was  performed  on  the  data 
and  it  was  found  that  differences  among  subjects, 
between  scores  obtained  in  the  morning  and  scores  ob¬ 
tained  in  the  afternoon,  between  scores  obtained  for  a 
noise  when  it  was  presented  first  in  a  session  and  scores 


*  (a)  C.V. Hudgins, J.  E.  Hawkins, J.  E.  Karlin, and  S.S. Stevens, 
“The  Development  of  Recorded  Auditory  Tests  for  Measuring 
Hearing  l.oss  for  Speech,”  Laryngoscope  57, 57  (1947).  They  show 
a  4%  pc’-  dB  change  in  monosyllabic  scores  in  the  vicinity  of  50%. 
Fairbanks  (Ref.  6}  states  the  slope  to  be  3%  per  dB  over  an  ex¬ 
tended  (33%  to  *1%)  range,  but  his  data  when  restricted  doscr 
to  the  neighborhood  of  50%  show  more  like  a  5%  ]>cr  dB  slo;>c; 
fb)  VV.  E.  Montague,  “A  Com|?ari50ii  of  Five  Intelligibility  Tests 
for  Voice  Communication  System,”  XEL  Report  977,  PBI57  229, 
AD254-545  (I960).  He  used  the  rhyme  words  at  the  rate  used  in 
this  study,  and  gels  a  slope  of  about  7%  per  dB.  Had  either  7% 
or  4%  been  used  instead  of  5%,  the  difference  in  correction  would 
have  been  less  than  1  dB  for  the  noise  deviating  most  fro.n  50%: 
namely,  noise  $  7,  generator. 
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obtained  for  a  noise  when  it  was  presented  second  in  a 
session,  and  among  the  variability  of  intelligibility 
scores  obtained  with  each  cf  the  16  noises  were  not 
statistically  significant. 

IV.  PHYSICAL  PROPERTIES  OF  EQUALLY 
SPEECH-INTERFERING  NOISES 

It  is  apparent  from  column  6  of  Table  I  that  the  “C” 
level  of  a  sound-level  meter  does  not  predict  the  speech- 
interfering  properties  of  noise.  These  noises  were  ad¬ 
justed  in  level  to  give  equal  scores  on  a  speech-intel¬ 
ligibility  test,  but  the  levels  as  measured  vary  from  78.8 
to  105.3  dB(C)  with  a  standard  deviation  of  6.9  dB. 

It  is  neither  new  nor  surprising  to  find  that  the  dB(C) 
level  of  a  noise  does  not  predict  its  speech-interference 
properties.  Many  schemes  have  been  worked  out  that 
take  into  account  the  two  prime  determiners  of  speech 
intelligibility  in  noise:  namely,  the  spectrum  of  the  noise 
and  the  speech,  and  the  difference  in  level  between  the 
speeds  and  the  noise.  The  most  sophisticated  of  these 
sdtemes,  the  articulation  index  (AI),  is  not  discussed 
in  this  pape«-,  but  is  discussed  in  the  following  paper.® 
Here,  measurements  and  calculations  are  confined  to 
simple  measures  on  the  noiser  themselves:  the  noises 
that  have  been  equated  in  level  to  be  equally  speech- 
interfering. 

Three  methods  of  measurement  are  discussed:  the 
reading  of  a  sound-level  meter  with  weighting  net¬ 
works,  tite  fitting  of  plotted  noise  spectra  to  families  of 
noise  criteria  (NC)  curves,  and  the  calculation  of 
speech  interference  level  (SIL).  Good  summary  state¬ 
ments  of  SIL  with  the  limitations  and  assumptions  de¬ 
tailed  are  given  by  Rosenblith  and  Stevens**  and 
Beranek.11  The  general  idea  is  that  if  the  level  of  noise 
in  each  of  three, c  four,13  or  five11  octave  bands  is  known 
speech  levels  needed  for  adequate  communication  at 
specified  distances  can  be  predicted.  Noises  with  dif¬ 
ferent  spectra  and  levels  that  yield  the  same  SIL  would 
be  equally  speech-interfering.  That  is,  the  level  of 
speech  needed  to  be  understood  in  a  given  SIL  would  be 
the  same  regardless  of  the  combination  of  spectra  and 
levels  that  produced  the  SIL. 

For  an  office  space  to  be  acceptable  it  mast  be  quiet 
enough  to  allow  easy  face-tc-face  or  telephone  voice 
communications.  A  set  of  noise  criteria  (NC)  curves 


•J.  C.  Webster  and  R.  G.  Xiurr.pp,  “Articulation  Index  and 
Average  Curve-Fitting  Methods  of  Ivedicting  Speech  Inter¬ 
ference,”  J.  Acoust.  Soc.  Am.  35.  1338  (1553). 

19  \V.  A.  Rosenblith  and  X.  N.  Stevens.  Handbook  vf  Acoustic. 
Moist  Control.  Vol.  2:  lid re  and  Man.  Wright  Air  Development 
Center  Tech.  Kept.  52-204  (1953). 

u  L.  L.  Beranek,  Acoustics  (McGraw-Hill  Book  Company,  Jnc., 
New  York,  1954),  pp.  415-420. 

°  L.  L.  Beranek.  “Airplane  Quieting  II  -  Sped  Sea  (ion  of  Accept¬ 
able  Noise  Levels,"  Trans.  Am.  Soc.  Mech.  Eng.  59, 07  (5947). 

51 M.  Suasbcrg.  “Criteria  for  Setting  Airborne  Noise  I-cvd 
Limits  in  Shipboard  Spaces,"  Bureau  of  Ships  Resort  No. 
371-N-12  (1962). 

M  J.  M.  Pickett  and  K.  D.  Kryter,  “Prediction  of  Speech 
Intelligibility  in  Noise.’’  Air  Force  Cambridge  Research  Center, 
Tech.  Rept-  55-4  (1955). 


which  would  predict  office  acceptability  would  indiiectly 
predict  adequate  voice  communications.  Beranek15  has 
developed  such  a  set  of  NC  curves  having  numbers  as¬ 
signed  which  are  the  average  of  levels  in  the  600-1200-, 
1200-2400-,  and  2400-4800-cps  octave  bands.  Any  noise 
that  does  not  exceed  a  level  in  any  octave  band  greater 
than  a  specified  NC  curve  level  is  assigned  an  SIL 
number  corresponding  to  that  NC  curve.  If  the  spec¬ 
trum  of  a  noise  is  known,  its  SIL  can  be  estimated  from 
the  NC  curves.  This  fitting  of  a  noise  spectrum  to  a  set 
of  noise  criteria  contours  will  be  called  the  “tangent- 
to-curve”  method,  and  constitutes  a  second  way  of 
estimating  the  speech-interfering  property  of  a  noise 
from  knowledge  of  its  physical  properties. 

Young16  has  pointed  out  . .  that  the  NC  curves  are 
similar  in  shape  to  the  inverse  of  the  A -weighting.”  The 
manner  of  use  of  the  NC  curve  makes  it  comparable  to 
frequency  weighting  having  a  shape  inverse  to  the  NC ; 
therefore  A  levels  should  correlate  closely  with  NC,  at 
least  for  sounds  having  tangenry  to  the  NC  curve  over 
a  limited  frequency  band.  The  latter  qualification  is  re¬ 
quired  because  the  sound-level  meter  integrates  power 
over  die  entire  spectrum  after  weighting,  whereas  the 
tangent-to-curve  technique  responds  only  to  the  maxi¬ 
mum  tangency  level,  whether  it  be  a  single  (joint  or 
parallels  the  entire  NC  curv  \  Considering  the  response 
of  the  car  to  masking  and  loudness,  one  would  expect 
the  integrating  measure  to  be  uperior  to  the  non¬ 
integrating  one. 

Since  all  the  16  noises  were  adjusted  in  level  to 
produce  equal  speech  inteliigibiir  v  in  part  III  of  this 
experiment,  the  effectiveness  of  my  of  the  prediction 
methods  (SIL,  NC,  A  sound  lcve ,  etc.)  can  be  assessed 
by  its  ability  to  yield  an  equal  lumber  on  each  of  the 
16  noises.  The  standard  deviation  around  the  mean  of 
the  16  measures  on  SIL,  NC,  A,  etc.,  is  thus  an  inverse 
measure  of  which  scheme  acfislly  predicts  speech  in¬ 
telligibility  the  best. 

Table  II  shows  the  various  measures  associated  with 
the  16  equated  noises.  Columns  1  and  2  identify  the 
noises.  Column  3,  beaded  “weight,”  lists  an  estimate 
made  by  three  individuals,  experienced  in  noise  meas¬ 
urements,  of  the  relative  fr.-quency  of  occurrence  of  each 
type  of  noise.  That  is,  in  a  general  sampling  of  ship 
noises,  noises  1,  3,  and  5  would  compose  about  a  third 
of  all  noise  samples  (11/33);  noises  7,  8,  9,  and  14 
another  third ;  and  the  remaining  nine  noises  would  be 
typical  of  the  remaining  third  of  noise  types.  Columns  4. 
3, 6,  and  7  show  the  over-all  levels  as  measured  on  the 
C.  B.  A,  and  DIN  3  (Re.  17)  weighting  networks  of  a 
sound-level  meter. 


11 L.  L.  Beranek,  “Revised  Criteria  for  Noise  ir.  Buildings,’ 
Noise  Control,  3.  No.  5,  19-27  (1 957). 

s‘  R.  W.  Young,  “Don’t  Forgrt  the  Simple  Sound-Level  Meter,” 
Noise  Control.  4,  No.  3.  42-41'  (195S). 

!I  DIN  5045,  April  1962,  A.  Peterson  nr.d  1*.  V.  Bred, 
“Instruments  for  Noise  Mersurcnwnts,"  in  Handbook  oj  .Vc:if 
Control.  edited  by  C.  M.  Hrrris  (McGraw-Hill  Bock  Com; any, 
Inc.,  New  York,  1957),  Chap.  :6,  p.  16-13,  Fig.  16.12(b). 
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Table  II.  Physical  measures  or  calculations  in  dB. 


Xok  Weight 

Krwjomc  y  "Bright icx 

C  B  A  DIX3 

NO 

Ta2fcai*t<xaT\T 

XCA  ISO  ISO(R) 

600-4  $00 
CpJ 

SIL  calculations 

3  band 

300-8500  300-2400  5 JO-2000 
cps  Cits  Cps 

4  band 
250-2000 
cps 

1  Ship  RtisoUe 

3 

1053 

963 

£63 

91 

so 

SS 

77 

663 

703 

73.0 

71.6 

763 

2  Grab 

1 

93.6 

943 

S6.3 

*03 

S5 

81 

$7 

so 

64.4 

69.0 

71.7 

71. i 

763 

1  Bfc'-— i 

* 

92-3 

903 

85.1 

s;.i 

85 

79 

S3 

S2 

67.0 

71.7 

75.4 

73.7 

773 

4  T\  a 

• 

S6.S 

833 

793 

773 

75 

73 

74 

74 

693 

71." 

73.7 

73.1 

75.1 

*  3L,v--  f.  IJ  jfl 

5 

S3.S 

823 

783 

77.! 

75 

75 

75 

75 

65.6 

71.4 

73.2 

723 

73.7 

6  'Vjr 

1 

783 

78.6 

753 

743 

69 

69 

71 

71 

68.9 

70.0 

69  J! 

683 

70.0 

<  'v/t  jvT  JT 

-» 

793 

793 

78.6 

77.9 

72 

72 

74 

74 

69.9 

703 

7ai 

69.6 

70.0 

\ 

3 

$3.0 

SO.6 

783 

78.1 

72 

73 

75 

74 

71.4 

72.1 

72.4 

72.0 

72.6 

9  KihJrlf 

5 

803 

Sl-3 

£03 

£03 

76 

76 

S  t 

77 

733 

7 3.2 

74.4 

743 

723 

10  T  N  Flit 

i 

*0.7 

823 

£1.6 

si  .e 

75 

7S 

SO 

so 

74.6 

73.4 

723 

72.6 

71.4 

11  Anejtipg  Cnr 

1 

v«3 

£23 

78 

78 

79 

TO 

75.2 

765 

78.1 

773 

77  9 

•  I  ‘--tiiimr  Koon* 

2 

Si.  2 

£63 

£43 

S23 

SO 

7S 

£1 

81 

74.2 

76.7 

78.9 

77.4 

7 $2 

>3  Av-  Grinder 

• 

653 

84.3 

853 

S3 

84 

SS 

82 

75.7 

76.0 

74.2 

753 

\0 

M  TrirsTitti 

3 

SisA 

87.9 

S7.4 

87.9 

85 

85 

87 

S3 

78.0 

763 

74.1 

743 

7*3 

15  TN-f* 

2 

SS3 

90.: 

S9.5 

893 

85 

S3 

57 

87 

so.o 

77.0 

74_S 

75.9 

72.7 

:c  Jet 

1 

933 

95.1 

943 

94-S 

93 

94 

95 

90 

81.0 

793 

76.1 

79.9 

77.4 

Karat 

263 

183 

’S3 

2O.0 

24.0 

75.0 

24.0 

19.0 

16. 6 

103 

9.7 

103 

7.9 

Mean  (16) 

S7.0 

£63 

Sj3 

82.0 

£03 

75.7 

81.1 

79.1 

72.4 

73.0 

73.9 

73.7 

742 

Sran.  Dee.  Co) 

7.4 

53 

4.7 

5.0 

6.5 

S3 

6.4 

4.5 

43 

3.1 

23 

23 

2.7 

Stan.  Dee.  (33) 

6.7 

5.4 

43 

43 

63 

4.7 

5.9 

43 

43 

2.6 

2.1 

2.4 

2.4 

Mean  (33) 

S73 

£5.9 

£23 

81.0 

793 

77.6 

803 

753 

71.4 

•32 > 

733 

73.4 

74.2 

Rani  1  36 

13 

10 

5 

s 

11 

9 

12 

63 

63 

4 

1 

3 

2 

Order  J  33 

13 

10 

5 

7 

11 

9 

22 

7 

7 

4 

i 

23 

23 

Gronp  Rank 

3 

2 

3 

2 

1 

i 


Columns  8  and  9  result  from  overlaying  the  noise 
spectra  ,cig5.  1  to  16)  onto  families  of  NC  and  XCA 
(noise  criteria  alternate)  contours.  Since  there  are  no 
XC  or  XCA  contours  above  70,  ratings  are  extrapolated 
on  the  assumption  that  contours  above  70  would  be 
drawn  parallel  to  the  70  contours. 

For  columns  10  3nd  11,  the  noise  spectra  were  over¬ 
laid  ento  a  whole  family  of  ISO  contours.1*  The  contour 
(usually  interpolated)  just  tangent  to  the  spectral 
peaks  was  noted  in  column  10.  In  column  11  are  the 
results  when  only  the  range  between  500  and  2000  cps 
was  used;  this  is  the  restricted  range  ISO  application 
and  is  labeled  the  ISO(R)  method. 

Column  12  shows  the  simple  arithmetic  average  of 
the  sound-pressure  levels  (measured  in  dB)  in  the 
octaves  600-1200,  1200-2400,  and  2400-4SQO  cps.  This 
is  the  common  SIL  calculation.1*  Column  13  shows  the 
Strasbergu  SIL  calculation  which  is  the  arithmetic 
average  of  the  dB  levels  in  the  octaves  300-600, 
600-1200,  1200-2400,  and  24GO-4SOO  cps.  Column  14 
is  an  average  of  the  sound-pressure  levels  in  three 
octaves  from  300  to  2400  cps. 

Columns  15  and  16  are  averages  of  sound-pressure 
levels  m  octaves  centered  at  the  "‘preferred  frequen- 
des"H  of  5$),  1000,  and  2000  cps  (col.  15,  3  band),  or 
250,  500,  1000,  and  2000  cps  (col.  16,  4  band). 

Below  each  column  is  listed  the  range  of  measure¬ 
ments  (the  difference  between  the  largest  and  smallest 
number),  the  mean  measure,  and  the  standard  deviation 
of  the  measurements.  Two  means  and  standard  devia- 

!J  International  Standards  Oryinirati-m  Technical  Commilire- 
45  {3etrflari3J-159;235,  Draft  Secresariat  Proposal  for  Noise 
Ratine  Numbers  ait h  respect  to  Conservation  of  H.arinc. 
Speech  Communication  and  Annoyance  {Aug.  1961  >. 

‘‘American  Standard  ior  Preferred  Frequencies  for  Acoustscaf 
Measurement.”  Sl.6-1%0  fAmertcan  Standards  Association.  X«-c 
York). 


tions  are  shown,  one  based  on  the  16  measures  as  listed 
and  a  second  based  on  the  weights  (which  total  33) 
given  in  column  3.  That  is.  the  mean  (33)  and  standard 
deviation  (33)  are  based  on  3  noises  like  #  1,  one  iike 
#2,  3  like  #3,  etc.  The  ideal  calculation  scheme  or 
measurement  system  would  show  a  zero  for  both  range 
and  standard  deviation.  The  most  consistent  system  is 
the  one  with  the  smallest  standard  deviation. 

The  measurement  or  calculation  schemes  can  be  ar¬ 
ranged  in  rank  order  of  excellence  by  assigning  the 
rank  1  to  the  smallest  standard-deviation  measure  and 
13  to  the  highest.  This  is  done  on  Table  II  at  the  bottom, 
but  the  ranks  must  be  interpreted  with  caution.  Each 
standard-deviation  measure  has  its  own  error  of  meas¬ 
urement  [the  standard  error  of  a  standard  deviation  is 
(2X)-i  times  the  standard  deviation).  If  this  is  taken 
into  account,  the  measurement  schemes  can  be  rank- 
ordered  into  more  meaningful  groups  as  follows:  (1)  all 
SILJs  except  600-4890  cps;  (2)  SIL  (600-4800  cps), 
ISO(R),  A,  DIX3;  (3a)  B,  XCA,  and  (3b)  ISO,  XC, 
and  C 

From  this  grouping  it  is  apparent  that  all  SIL 
methods,  except  the  conventional  600-4800-cps  one, 
predict  the  speech-interfering  properties  of  noise  best. 
The  A-  or  DIX  3-weighted  sound-level  measurements, 
the  conventional  (600-4800  cps)  SIL,  and  the  ISO(R) 
curve  are  next  best,  and  the  remaining  tangent-to-curve 
methods  and  the  B-  and  C-weighted  sound  levels  are  the 
worst. 

These  results  can  he  better  visualized  in  Fig.  18, 
which  is  a  plot  of  the  data  in  Table  II.  In  Fig.  18  it  will 
be  noted  that  the  frequency  weighting  networks  give 
higher  readings  than  the  SIL  measurements.  This  is 
because  (1)  the  whole  spectrum  is  measured  and  (2)  the 
levels  combine  as  the  squares  of  the  sound  pressures. 
In  the  SIL  measurements,  sound-pressure  levels  above 
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SPEECH-INTERFERING  NAVY  NOISES 


F;g.  18.  Plot  of  da ta  in  Tabk  II. 
Measured  or  calculated  levels  in  <1B 
for  16  noises  equated  in  level  tc  be 
equally  speedi-interfering. 


and  below  a  certain  cutoff  point  are  ignored  and  the 
octave  band  levels  are  arithmetically  averaged:  they 
are  not  summed. 

Inspection  of  Fig.  18  shows  that  the  noises  group 
themselves  into  roughly  four  dusters.  The  first  live 
noises  are  characterised  by  high  C  readings  followed  in 
order  by  B,  A,  and  DIN  3.  These  are  noises  with  con¬ 
siderable  low-frequency  energy.  This  is  verified  by 
noting  that  the  SIL  over  the  300-24 90-cps  octaves 
exceeds  the  30O-4S00-cps  SIL.  and  both  exceed  the 
«XMS00-cps  SIL. 

The  next  group  of  five  noises  is  characterized  by 
nearly  equivalent  readings  among  the  C,  B,  A,  and 
DIN  3  network,and  among  the  three  SIL-typc averages. 

The  arresting-gear  and  engine-room  noises  are  mal¬ 
contents.  The  relative  levels  among  C,  B,  A,  and 
DIN  3,  and  among  the  SIL’s  are  like  the  low-frequency 
noises.  But  the  magnitudes  of  these  same  measures  lie 
midway  between  the  flat  spectrum  noises  and  the  re¬ 
maining  four  high-frequency  noises.  One  of  these  noises, 
the  arresting  gear,  is  very  intermittent  and  is  hard  to 
specify  and  to  measure.  The  other,  Lubckc’s  engine 
room  noise,  is  peaked  in  the  mid-frequences  and  is 
neither  a  high-  nor  low-frequcncy  noise,  nor  a  flat  one. 

The  last  four  noises  are  characterized  by  approxi¬ 
mately  equal  readings  on  networks  A,  B,  and  DIN  3, 
and  all  of  them  greater  than  C,  and  by  the  300-2400-q>s 


SIL  being  less  than  the  30(MS00-cps  SIL,  and  both  less 
than  the  60G-4S00-£D5  SIL.  These  are  predominantly 
high-frequency  noises. 

Three  base  methods  of  specifying  the  physical  char¬ 
acteristics  of  these  16  equally  speech-interfering  noises, 
have  been  shown  in  Table  II  and  Fig.  20.  These  are 
(1)  the  frequency-weighting  network  method  (A,  B,  C 
and  DIN  3)  as  used  on  sound-level  meters;  (2)  the  SIL 
calculation  methods  [arithmetic  average  of  sound- 
pressure  levels  (measured  in  dB)  in  contiguous  ooave 
bandsj;  and  (3)  the  tangent-to-standard-curve  method 
(adjust  peaks  of  plotted  spectra  to  rets  of  curves).  Of 
these  three  methods,  the  SIL  measurements  show  the 
lowest  values  of  dispersion.  The  A-  and  DIN-3-wdght- 
ing  networks  are  the  best  of  the  meter  network  methods. 
The  best  langent-to-curve  methods  are  those  utilizing 
a  restricted  range  or  a  steeply  riang  low-frequency 
contour. 

These  three  methods  w  ork  in  different  ways  and  it  is 
pertinent  to  point  out  how  they  differ.  The  simplest  in 
concept,  but  the  worst  in  predictive  ability,  is  the 
tangent- to-curve  method.  In  this  method  only  the  noire 
peak  that  first  becomes  tangent  to  the  generalized 
curvc(s)  determines  the  measure.  Any  pure  tone  com¬ 
ponent,  or  any  restricted  band  component,  that  differs 
drastically  from  its  surround  specifics  this  rating. 

The  frequency-weighting  networks  add  components 
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op  a  power  basis;  Le.,  two  equal  components  add  3  dB 
to  the  total,  and  a  single  component  10  dB  greater  than 
its  neighbors  essentially  determines  the  level.  Both  of 
these  methods  (tangent-to-curve  and  frequencv- 
weighting  network)  are  very  susceptible  to  tonal  or 
narrow -band  high  encrgv  components.  And  it  can  be 
inferred  from  Licklider  ano  Guttnsan58  that  tonal  com 
ponents  do  not  mask  speech  very  effectively. 

Unlike  the  tangent  and  network  methods— which  are 
determined  by  one  (tangent)  or  mom  (network)  energy 
peaks  and  give  readings  eqtal  to  (tangent),  or  g.cn»er 
than  (network),  tne  highest  peak — the  SIL  ptcthod 
lowers  are  importance  of  a  peak  by  averaging  in  lower 
levels. 

The  fact  that  speech-intelligibility  prediction  is 
better  when  the  300-600-cps  octave  is  included  in  the 
SIL  calculation  is  not  a  new  finding.  Beranek11  states 
on  page  419,  **_  _  .  if  the  level  in  the  300  to  600  <ps 
hand  is  not  more  than  10  dB  above  that  in  the  600  to 
1200  cps  band,  use  the  600  to  1200  cps  hand  as  the  first 
band  and  then  define  the  speech-interference  IcveJ  cs  the 
arithmetic  average  of  the  sound  pressure  levels  in  the  three 
oands  600  to  1200,  1200  to  2403,  and  2400  to  4SQ0  cps. 
However,  if  the  levels  in  the  300  to  600  cps  band  are 
mare  than  10  dB  above  those  in  the  600  to  1200  cps 
band,  the  average  of  the  levels  in  the  four  bands 
between  300  and  4-SGG  cps  should  be  used  instead.” 
Ship  noises  tend  to  predominate  in  low-frequency  sound 
so  it  is  not  surprising  that  Strasbergn  standardized  os 
a  30O-4Ktt-cps  SIL  band  when  treating  ship  noises. 
The  present  data  tend  to  support  the  view  that  the  top 
octave  2400-4800  cps  can  fee  diminated  from  the  SIL 
calculations  without  undue  loss  in  predictive  ability.  In 
fact,  for  these  ship  sounds  the  240O-4S0O-cps  octave 
just  adds  additional  disperson  in  the  measurements. 

SUMMARY  AND  CONCLUSIONS 

Recordings  of  ship  noises,  office  noises,  and  labo¬ 
ratory  (shaped  thermal)  noises  were  assembled  to  make 
up  a  sample  of  16  diverse  noises.  These  noises  were  ad- 


—  J,  C.  R  LickHdrr  asni  X.  Grtussa,  -Maskin'  of  Spetci  by 
Luw  Sjieciruni  Interference.”  J.  AcoaS.  Soc.  Ass.  29.  2S7-296 
{19r><).  Data  on  Fig.  11  (jc  293 j  sbo*  that  for  -iced  snktus- 
*>oq  score;  soar  <nuseods  wodd  need  be  15  dB  xsarc  iatssse  than 
random  noise.  It  shock!  foScrr  tiers  that  toe  ce  two  sscasoe-L 
10  dB  above  a  random  ssssc  baclxrrerrxi  ■stack!  not  farther  de¬ 
crease  word  articulation  screes 


justed  in  over-ail  levels  so  that  listeners  bearing  mono¬ 
syllabic  words  at  a  constant  level  via  a  loudspeaker 
achieved  Si/'l  word-intelligibility  scores.  Insofar  as  it 
is  possible  to  generalize  from  measurements  based  on 
16  noise  samples,  physical  measurements  and  calcula¬ 
tions  on  equally  speech-interfering  noises  show  that  the 
speech-interfering  properties  of  noises  is  best  estimated 
by  averaging  the  sound-pressure  levels  in  mid-frequency 
octaves,  then  by  use  of  frequency-weighting  networks 
in  sound  level  meters,  and  finally  bv  fitting  the  spectral 
peaks  to  noise  contours  More  specifically :  the  best  of 
the  SIL  methods,  the  300-2409-cps  SIL  calculation, 
gave  slightly  better  predictions  than  any  other  com¬ 
bination  of  octaves.  In  line  with  standardizing  on  “pre¬ 
ferred  frequency"  octaves  it  is  encouraging  that  the 
3-hand  (500-2DCO  cps)  SIL  and  the  4-band  (256-2000 
cos)  SIL  gave  results  similar  to  those  obtained  with  the 
500-2400-cps  SIL. 

Ol  the  next  best  method  (weighting  networks),  the 
A  aid  DIX  3  were  equivalent  and  as  good  as  the 
600-4 SOQ-cps  SIL,  all  of  which  were  belter  than  the  B 
and  C  networks.  As  a  first  approiimaiion,  and  in  the 
absence  of  octave  band  filters,  A  or  DIX  3  sound-level 
meter  measures  may  be  used  to  predict  the  speech 
interference  of  steady-slate  noises. 

Among  the  generally  poorest  methods  (noise  contour 
criteria),  the  ISO(R)  was  as  good  as  the  A  or  DIX  5~ 
weighling  network,  or  the  60C*-4SOO-cps  SIL,  and 
although  the  XCA  curves  gave  better  prediction  than 
the  ISO  contours  and  the  XC  carves  the  tangeat-to- 
curve  methods  as  presently  used  are  not  very  good,  no 
better  than  the  B- weigh  ting  network. 
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meaHste  s&rt  as  the  «acd-kid  meter,  ibf  VU 
(n&at  einll)  meter,  and  tbe  lasnm-Sobs  vo5t- 
ssrtr?  display  sirrad  kid  in  1»«k  cf  ihe  padtion 
of  a  painter  against  a  calibrated  scale.  Tbe  kid 
k  read  by  an  cokjt<t  or  aisnuBs  and  s  n®*5hr 
recorded  as  a  single  camber  representing  tbe  ar- 
*T*a  painter  position  daring  she  sampling  period. 

In  rantiae  wkefeid  zaeasaremeats.  it  is  not 
nuiawm  for  t  •  o53sen«s.  bcflfa  reading  tbe  same 
meter,  to  crane  up  with  t*o  different  narsbers  So 
represent  tbe  aissse  krd.  If  iba  ook  being 
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siwtsanal  k  araiivrii  <raii»  in  Ind  «*r  flortc*l<s 
RmUiiT  mitbia  a  limited  range  of  kieK  tbere 
s  asalhr  little-  sskastrctmeai.  liwwtrt-.  if  tbe  wm 
doetontes  martedh  and  irregaiaxly  m  kni  tbe 
job  of  estimating  tbe  average  kid  k  not  «-«•, 
jctesfaJs  an  sd  made  shb  etsisdrwr,  and  dk- 
jgswafBi  among  eJtsrrrrts  s  Hk-'iy. 

Tass  problem  is  sceaetsmes  bandied  by  aoeisnnr 
"be  eSiaatkss  made  by  several  (bwiers.  bnt. 
often,  to  sas?  time.  one  person  w  appointed  as  tbe 
oSsd  meter  reader,  and  fcis  estiaatiss  ate  ac¬ 
cepted  as  tbe  etrrrrt  nlar.  The  inaccuracy  =tati 
may  be  introduced  by  :hk  latter  proordcre  is  ordi¬ 
narily  ignored  in  msticr  mdse  sotasETessrsis  and 
ss  sssaUy  assmsed  to  be  less  than  tbe  natccaracy 
inherent  ia  tbe  sstassiiig  rnsnsaraation  or  sam¬ 
pling  ptwwBies. 

In  sme  of  oar  wk  woaerted  *rilh  tbe  sneaanre- 
meat  of  tbe  speeeh  imerfersrite  of  noise.  a  number 
of  b»c  kids  aero  to  w  measaned  *iso  srend-kid 
meters  *&S  anrea-tele  vc&wtas  It  *K  not 
practical  to  «-«*  a  gnap  of  obsentrs,  sad  it,  tnere- 
foTC.  became  desbahSe  to  assess  tbe  bias  ricdi 
might  l*e  isintioecd  by  tbe  me  of  a  single  met-**- 
reader.  Accordingly  sn  apniaKit  *w  coodncted 
to  determine  tie  ckpesmoes  among  observers  eti- 
raati&g  tbe  avenge  level  of  stapks  of  steady  stale 
and  ncvtsaiisg  sokes.  Each  tf  nine  observers  est:- 
salej  tbe  average  level  of  each  of  16  «i.n  event  noise 
ssapfa  diphjvfi  oas  ;brv—  dttfevrot  jsnfiar-iws! 

meters. 

Pncriat 

risB  >er.o-w  a  saakft  of  artrts  «a  band  in  tbe 
laboratory.  three  meters  ^tre  tassm.  pnmsrily  sn 
the  bask  of  a  dirTerw*re  in  damping  chararierisrirs. 
Tbe  three  meters  ■srketed  wm:  *11  a  Iltsktt- 
i^sdard  Ccapiay.  model  -KGA  irarrsktor  nst 
meter;  i-i  a  fienral  S*i»  Ccafaar.  type  lii!-A 
vcmfirivl  tnHir.  rci  “3;  *  s**J  ^7*,T- 

■amid  tfWtt  k.-.ksi  iiiV  iniMoHie  rirBic'  1.  - 
awl  3  are  iJx7<irn;A>  *»f  tbe  three  (orfer*.  s»l 
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Table  I  lists  stren]  danflaiflia  of  ibt  meters 
*s  they  were  employed  in  the  experiment. 

It  is  apparent  frees  Figs.  1-3  and  Table  I  that 
the  sorters  differed  ia  a  number  of  resptass.  To 
the  observer  reading  the  meters  ia  this  <  t-periment. 
however,  the  primary  difference  amour  the  meters 
was  the  degree  of  damping.  The  Ileuiett-Paeiard 
(HP)  meter  set  for  “1  eps-1  Me”  (  cycles  per  sk- 
oe»d  aad  megacycles  per  second)  response  almost 
tea^fldr  “iimed  oat"’  masceatery  fsciezlkss 
in  Jetd:  the  Brad  and  Kjier  'B  £  K)  meter  set 
to  “Pai-VU"  readily  responded  to  mass  momen¬ 
tary  f actuations;  the  Genera]  Radio  (GR)  meter 
set  to  “A  ■areiritrng-SlGJr  Meter”  was  intermedi¬ 
ate  aasci^  the  three  ss«ns  ia  the  rapidity  of  it* 
response.  Differences  in  damping  amass  the  three 
meters  are  shewn  in  Fie.  4,  which  p5»0  tbr  prr 
cent  of  pointer  angular  (bdertiaa  z<  a  fccliea  of 
time  when  cad!  kjs  activated  by  the  abrupt  onvr* 
of  a  sccJaned  IrtXIcps  signal.  Firarr  4  dwas  that, 
as  the  estns  were-  adjusted,  the  time  to  reach  TO*, 
cf  fnlkttlc  aagnlar  deflection  was  less  than  0.2 
second  (see)  for  the  B  £  K  meinr,  a  boat  03  s#c- 
«ad  for  the  GR  meter,  aad  over  23  sttceds  for  the 
HP  meter.  The  tanes  shown  were  obtained  o«ng 
sm  observer  with  a  stop  ektk  to  liac  pointer  ansn- 
lar  deSeetiwss.  aad  are,  therefore,  zpproxnsatsuav 

Table  II  presents  information  on  the  15  »&i*>- 
stsapjesv.  cstd  in  the  experiment.  Seme  of  the  wic 
samples  were  rrrerded  aixeard  operatise  Xavy 
ships,  sow  were  recorded  in  six-ps  aad  <Otys  on 
land,  and  a  few  were  generated  in  the  laboratory. 
The  corse  samples  ha  Table  II  are  ordered  from  1 
w  :S  cc  the  basis  of  frequency  characteristics. 
Xoms  No.  1  through  Xa  5  eeatain  predominantly 
5w  fncjatBcies;  seises  Na  6  through  No.  12  have 
roast  of  ibei.-  ««gT  ia  the  mid-frearaency  region, 
asd  noises  No  13  through  No.  16  hsre  sSnor  high- 
frequency  components^ 

The  duration  of  each  saspie,  listed  »a  «tarns  -5 
of  Table  II.  was  tailored  to  be  jufaeient  for  as  ob¬ 
server  to  make  a  sasrie  judgment  on  a  meter.  X«se 
sas-nJes  which  varied  only  slightly  or  regularly  ia 
lend  were  presented  for  a  minimum  length  of  time 
*9  see).  Other  samples  which  varied  irregalsriy 
or  widely  ia  level  were  presented  for  periods  of 
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time  up  to  44  see.  A  5-see  interval  htween  samples 
permitted  observers  to  record  estimations  made  on 
the  meters. 

The  observed  variation  in  level  of  each  noise  sam¬ 
ple  differed  among  the  three  meters  because  of  dif- 
fere-ntes  in  carter  damping,  frequency  response  and 
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rwtificd  circuitry.  Figure  ■>  slums  noise  traces 
Wade  with  a  power-level  nnmifr  adjilsi-d  to  simu¬ 
late  the  ncs|>onso  «f  tin*  It  &  K  meter,  i.c..  \  !'  damp¬ 
ing.  peak  reading.  Tills  figure  shows  that  a  l*w  of 
the  noises  varied  relatively  little  in  level.  some 
varied  moderately,  and  a  few  varied  markedly  in 
levei.  Each  trace  represents  a  time  interval  of 
slightly  more  than  13  see.  so  that  one  complete 
cycle  of  the  longest  noise  sample  (11)  could  be 
displayed. 

The  meters  were  connected  to  the  output  of  a 
tape  reproducer  with  the  signal  fed  into  the  GK 
sound-level  meter  via  the  microphone  connector. 
The  recorded  level  of  each  of  the  noise  samples  was 
as  far  as  possible  adjust'd  <»  allow  the  lfi  samples 
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fig.  4.  Approximate  dciapiag  ctrrrdtnstia  •(  the  three 
•eten  ud. 


to  1m-  displayed  without  changing  meter  settings 
J iet ween  samples.  Exceptions  were  noises  1  and  2 
for  which  the  gain  of  the  GR  meter  was  increased 
l*\  20  and  10  dH  (decibels),  respectively,  to  com¬ 
pensate  for  the  de-emphasis  of  low  frequencies  in 
this  meter  set  to  "A”  weighting.  Xo  attempt  was 
made  to  make  tiie  three  meters  agree  on  any  single 
noise  or  set  of  noises.  The  meters  were,  in  fact,  ad¬ 
justed  to  read  differently.  s<  as  to  minimize  the 
jxrssibiiity  of  estimations  made  on  one  meter  in¬ 
fluencing  estimations  made  on  another  meter. 

To  enable  subjects  to  h  _-ar  the  noise  at  the  same 
time  they  were  reading  the  level  on  a  meter,  the 
noise  samples  were  reproduced  over  an  extended 
range  loudspeaker  system  at  a  comfortably  load 
level.  All  subjects  were  employees  of  the  Navy 
Electronics  laboratory,  ami  ail  had  considerable 
experience  in  reading  many  types  of  meters. 

Each  subject  served  in  three  test  sessions — one 
»u  each  of  three  days.  During  each  test  sesd-n  each 
subject  made  a-  estimation  of  the  average  levei  of 
each  of  ibe  noise  samples  on  each  of  the  three  me¬ 
ters  ( lb  :>oiscr  times  3  meters  equals  48  judgments/ 
session  1-  The  sequence  of  meter  presentation  versus 
subjects  was  counterbalanced  over  the  three  ses¬ 
sions. 

Preparatory  to  the  first  t-st  session,  the  noise 
samples  were  played  through  twice  to  acquaint  tlie 
subjects  with  the  noises,  meters,  and  test  proce- 
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Fig.  6.  Standard  deviation*  and  ranges  of  estimations 
mcde  by  9  observers  on  each  of  throe  days. 


noises  made  on  each  of  tir.'o  meters.  Kuril  liar  is 
based  on  27  judgments  (one  judgment  on  eaelt  of 
tlnn*  da\s  from  each  t.f  nine  subjects).  The  niiin 
hen.  ami  abbreviations  a-ross  the  bottom  of  the 
figure  identify  tlie  noise  sampler,  (see  Table  II). 
The  bars  for  each  noise  are  arranged  with  the 
most  highly  damped  metc-r  (IIP)  to  the  left,  the 
intermediately  damped  meter  (OR)  in  the  center, 
and  the  least  damped  meter  (B  &  K)  to  the  right. 

Figure  6  shows  that  for  14  of  the  1C  temples  (5 
and  21  excluded)  the  standard  deviation  of  estima¬ 
tion  is  less  than  0.3  dB  and  the  range  is  2  dft  or 
less. 

An  analysis  of  variance  was  made  on  the  data 
obtained  with  nil  1C  noises  and  also  with  the  data 
from  14  noises  (6  and  11  excluded).  Results  of  the 
two  analyses  at  the  1%  level  of  confidence  were  the 
same.  Although  ail  main  variables  (Noises,  Meters, 
Days,  and  Subjects)  were  significant  at  the  1% 
level,  an  examination  of  the  standard  deviations 
shown  in  Fig.  6  indicates  that  the  total  variability 
from  trials  and  subjects  is  in  actuality  very  small. 


dures.  In  each  test  session  the  subject  was  in¬ 
structed  to  write  down  for  each  noise  sample  a 
single  number,  representing  his  best  estimate  of 
the  average  level  of  the  sample.  Subjects  were  told 
to  observe  the  pointer  position  dr.ring  the  entire 
time  the  sample  was  being  presented  and  to  obtain 
estimations  by  using  an  “arithmetic-averaging” 
procedure.  Thus,  according  to  instructions,  if  the 
pointer  deflection  was  “2”  for  4  the  time  and 
“8”  for  4  the  time,  the  desired  single-number 
average  reading  would  be  “5.0.”  If  the  deflection 
was  “2”  for  10%  of  the  time  and  “8”  for  .00% 
of  the  time,  the  desired  reading  would  be  “7.4.” 

Tt  was  apparent  that  the  subjects  would  not  be 
able  to  perform  computations  like  those  illustrated 
in  the  limited  time  available  during  and  between 
noise  samples.  The  instructions  about  averaging 
were  given  to  insure  that  all  subjects  started  with 
the  same  orientation  and  in  the  hope  that,  although 
specific  instructions  could  not  be  followed,  the 
general  philosophy  would  be  adhered  to. 

Subjects  were  instructed  to  read  only  the  decibel 
scales  on  the  meters  and  to  make  estimations  to  the 
nearest  0.1  dB  wl.  ‘never  possible.  Only  numbers 
corresponding  to  those  on  the  meter  scales  were 
recorded,  since,  with  the  meters  deliberately  thrown 
out  of  calibration,  the  absolute  values  had  no  mean¬ 
ing.  Before  each  of  the  test  sessions,  the  stability 
of  the  playback  and  meter  system  was  cheeked. 
Changes  in  the  level  of  recorded  sine  waves  and 
constant  level  noise  signals  were  less  than  —  0.2  dB 
as  measured  with  a  single  observer  reading  a  pre¬ 
cision  voltmeter. 

Results 

Figure  6  shows,  in  histogram  form,  the  standard 
deviation  and  range  of  estimations  for  each  of  Ifi 


Discussion 

For  14  of  the  16  noises  the  range  of  estimations 
is  2  dB  or  less  and  the  standard  deviation  is  less 
than  0.7  dB.  Considering  that  several  of  the  noise 
samples  contain  wide  variations  in  level,  the  agree¬ 
ment  among  27  estimates  (nine  subjects  on  three 
different  days)  is  surprisingly  good. 

Although  the  three  meters  differed  in  a  host  of 
characteristics  which  might  be  expected  to  affect 
the  estimations  (damping,  frequency  response,  ar- 
tangement  of  scales,  size,  and  number  of  scale  di¬ 
visions,  etc.),  there  appears  to  be  no  clear-cut  ad¬ 
vantage  in  using  one  meter  in  preference  to  another 
except  for  noises  6  and  11.  For  these  two  noises  the 
heavily  damped  meter  (IIP)  produced  the  smallest 
dispersion  of  estimations. 

The  obtained  standard  deviations  and  ranges  on 
noises  6  and  11  were  two  tc  three  times  iarger  than 
for  the  other  14  noises.  Clearly  the  ability  of  the 
subjects  to  agree  on  ‘ime-level  estimations  is  lim¬ 
ited.  It  should  be  no.ed  that  strong  objections  were 
voiced  by  most  of  the  subjects  about  making  the 
required  single-number  estimation  on  noises  6  and 
11.  They  felt  that  these  two  noise  samples  were 
made  up  of  several  distinctly  different  components 
and  should,  therefore,  have  been  estimated  in  terms 
of  two  or  more  separate  levels.  Sample  6,  the  power 
shear,  consisted  of  a  number  of  short  sharp 
“bangs”  in  a  sustained,  low-frequency  background 
of  motor  noise  while  sample  11,  the  arresting  gear 
noise,  consisted  of  low-frequeney  motor  and  ship 
noise,  a  shriek  changing  in  amplitude  and  fre¬ 
quency,  and  a  single  short  duration  “bang.”  Sub¬ 
jects  felt  they  were  able  to  estimate  the  level  of 
each  component  in  these  noises  with  a  reasonable 
degree  of  accuracy  but  could  not  asses-,  accurately 
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cstimuU*  tile  average  <1  rise  level  by  averaging  the 
pointer  deflections  on  a  time-weighted  basis.  This 
procedure  was  used,  because  the  instructions  are 
relatively  easy  to  follow  and  the  pr-'»eedu?e  is,  in 
fact,  one  that  is  commonly  used  in  noise-level  read¬ 
ing.  It  should  be  recognized  that  such  an  averaging 
procedure  does  not  result  in  an  accurate  estimation 
of  the  true  sound-pressure  level,  especially  wnen 
the  levels  being  combined  differ  substantially. 

Measurements  of  the  level  of  a  noise  made  with 
a  moving -coil  meter  arc  subject  to  error  from  at 
least  three  sources:  (1)  the  meter,  itself;  (2)  the 
measurement  technique  -  and  (2)  variability  or  bias 
from  the  observer.  It  is  of  interest  to  speculate 
about  the  probable  magnitude  of  these  errors.  It 
is  difficult  to  estimate  the  error  attributable  to 
meters  because  of  differences  in  the  precision  and 
circuitry  of  meters  in  current  use  and  because  the 
error  in  a  moving  eoii  meter  is  dependent  upon 
the  characteristics  of  the  noise  being  measured.  If 
the  desired  quantity  is  the  true  rms  level,  it  is  un¬ 
likely  that  commercially  available  meters  in  ordi¬ 
nary  use  will  be  more  accurate  than  ±  0.5  dB  and 
may  well  be  from  1  to  5  dB  or  more  in  error  on 
certain  noises.1  Errors  attributable  to  measurement 


measurement  of  noise  in  elect  ideal  systems,  the  sam- 
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measurements  made  on  acoustic  signals,  an  opti¬ 
mum  sampling  procedure  cannot  always  be  clearly 
specified  or  followed,  and,  in  such  cases,  the  meas¬ 
urement  technique  error  can  easily  be  several  times 
the  minimum  error  attributable  to  tbc  measuring 
instrument. 

The  data  in  this  experiment  suggest  that  tbe 
error  contributed  by  the  meter  observer,  provided 
the  averaging  procedure  is  specified,  is  small  enough 
to  ignore  in  routine  noise  measurements. 

1.  Moving  c(iil  meter*  with  *pe»'isl  ratifying  circuitry  are 
available  for  measuring  the  rms  level  of  noise.  Provided 
the  noise  being  measured  does  not  have  an  extreme  peak 
to  rm*  ratio  or  i*  not  extremely  short  in  duration,  such 
meter*  are  accurate  to  within  perhaps  ±0.5  dB.  The 
conventional  voltmeter  and  VU  meter,  calibrated  in 
terms  of  the  rms  of  a  sine  wave  but  actually  responding 
to  the  average  value  of  signals,  may  deviate  markedly 
from  true  rms  when  used  to  measure  noise  signals.  Set 
It.  M.  Oliver.  Hewlett-Packard  J.  1*,  No.  ",  1-4 
(1901):  E.  K.  Gross,  General  Radio  Company  Experi¬ 
menter  32,  No.  1/,  3-9  (195S);  (*.  G.  Wahrman,  Bruel 
and  Kjacr  Tech.  Rev.  No.  3,  9-21  (1958);  Janies  J. 
Davidson,  "Average  vs.  RMS  meters  for  Measuring 
Noise,"  IRE  Tran*,  on  Audio,  No.  4  (1961)  and  F. 
E.  Tevinan  and  J.  SI.  Pettit,  Electronic  ilcaxnrcmcntx 
(McGraw-Hill  Book  Company,  Inc.,  New  York.  1952), 
2nd  ed..  Chap.  1. 


Table  II.  Description  of  the  16  noise  samples. 


Name 

Description 

Maximum 
change  in 
level  in  dB 
(from  Fig.  5) 

Duration  of 
sample 
in  sec 

1.  Ship  Rumble 

Primarily  very  low-frequency  ship  rumble 

6 

15 

2.  Grab  engine 

Electrical  motor  of  large  sixe  with  strong  tow-frequency 
hum-like  components 

4 

35 

3.  Blower 

Low-frequency  blower  noise  recorded  aboard  ship 

G 

9 

4.  Th-rmal  noise,  — GdB 
per  octave  slope 

Low  frequencies  predominant 

4 

10 

5.  Blower  and  hull 

Low-frequenrv  rumble  and  machine  noise 

0 

10 

(i.  Power  shear 

Hydraulic  pewtr,  shear  rutting  metal,  includes  clang  of 
sheet  metal  dropping  to  floor 

1!) 

*29 

7.  Generator 

Shipboard  generator  with  whine  and  low-frequrncv 
components 

4 

*26 

8.  Compressor 

Refrigerator  compressor  with  rhythmic  sound 

5 

15 

it.  Voice  babble 

5  pairs  oi  communicators  exchanging  monosyllabic  words 

21 

31 

10.  Thermal  noise,  fiat 

Wiae-band  hiss 

3 

9 

11.  Arresting  gear 

Noise  recorded  in  arresting  gear  room  of  aircraft  carrier 
has  ntmbh ,  shriek,  bang,  and  roar  components 

21 

44 

12.  Engine  room 

Complex  machinery  noise  wide  band 

1* 

1*2 

13.  Air  grinder 

Air-driven  grinder  on  metal  sheet ;  high  pitched  “dental 
drill'’  sound 

11 

*20 

14.  Typewriter 

Typewriter  operated  at  40-60  words  per  minute — strong 
in  high  frequency  components 

22 

28 

15.  Thermal  noise  4-C  dB 
per  octabc  slope 

High-frequency  hiss 

3 

9 

10.  Jet  idle 

Jet  engine  on  flight  deck  of  carrier — has  strong  whine 
components 

7 

20 
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Sixteen  equally  speech-interfering  noises  were  rated  by  using  articulation  index  (AI)  calculations  and  by 
using  noise  criteria  (XC)-type  contours  as  averaging,  not  peak-finding,  devices.  Articulation-index  scores 
based  on  5  or  6  weighted  octave-band  levels  were  equal  to  AI  scores  obtained  by  the  20-band  method  and 
predicted  very'  well  that  the  noises  were  equally  speech-interfering.  The  use  of  KC-type  curves  (from  500  to 
2000  cps)  to  find  an  average,  not  a  peak,  noise  level  also  gave  good  prediction.  Equal ’y  speech-interfering 
noises  were  not,  however,  equally  “loud,”  nor  equally  “noisy.”  There  was  evidence  to  show  that  the  fre¬ 
quency  that  divided  noise-masked  speech  into  two  equally  intelligible  frequency  regions  was  850  or  1009 
cps,  and  was  not  1700  or  1900  cps,  which  is  the  dividing  frequency  when  the  speech  in  quiet  is  progressively 
restricted  in  bandwidth  by  high-  or  low-passed  filtering. 


INTRODUCTION 

N  two  earlier  papers,11  methods  of  measuring  the 
speech-interference  properties  of  16  diverse  noises 
were  discussed.  The  general  procedure  was  that  16 
noises  were  adjusted  in  level  so  that  listeners  hearing 
monosyllabic  (rhyme)  words  at  a  constant  level  via  a 
loudspeaker  obtained  50%  word-intelligibility  scores. 
Then  various  physical  or  psychophysical  measurements 
were  made  on  the  16  noises  reproduced  at  the  equally 
speech-interfering  levels.  Many  measurements  and/ or 
calculations  were  not  reported  in  the  two  previous 
papers  because  (1)  only  limited  presentation  time  and 
space  were  available,1  and  (2)  only  simple  schemes  were 
under  study.2 

This  paper  will  deal  with  the  articulation  index  (AI), 
the  use  of  tangc-nt-to-curve  contours  as  an  averaging 
method,  and  will  discuss  the  common  spectral  character¬ 
istics  of  the  noises. 

I.  ARTICULATION  INDEX 

The  AI  involves  the  difference  in  levels  between  the 
speech  spectrum  and  the  various  noise  spectra  in  differ- 

1  R.  G.  Kiumpp  and  J.  C.  Webster,  "Predicting  Speech  Inter¬ 
ference  from  Physical  and  Psvchophysical  Measures  of  Ambient 
Noise,”  J.  Acoust.  Soc.  Am.  35,  1116(A)  (1963). 

1 R.  G.  Kiumpp  and  J.  C.  Webster,  “Physical  Measurements  of 
Equally  Speech-Interfering  Naw  Noises.”  J.  Acoust  Soc.  Am. 
35,  1328  (1963). 


ent  bandwidths.  This  S-N  difference  in  dB  can  range 
from  zero  to  30,  since  no  negative  values  are  allowed, 
and  all  values  greater  than  30  are  called  30.  S-N,  from 
200  to  6800  cps,  can  be  calculated  in  20  bands,  varying 
in  bandwidth;  or  in  third-octave  bancs;  or  in  octave 
bands.  Regardless  of  how  many  bands  are  used,  an 
average  S-N  in  dB  (between  0  and  30)  is  found  and 
divided  by  30  to  get  a  number  from  0  to  1  which  is 
called  the  articulation  index.  For  further  background 
information  on  AI,  see  Licklider  and  Miller*  for  history 
and  rationale,  Hawley  and  Krvter4  for  details  of  its 
use,  Licklider5  for  a  critical  review  of  the  assumptions 
involved,  and  Krvter*-7  for  recent  modifications,  re¬ 
finements,  and  revalidation. 

In  this  paper  it  will  be  desirable  to  compare  the  AI 
results  to  measures  from  Refs.  1  and  2  always  meas- 


*  J.  C.  R.  Licklider  and  G.  A.  Miller,  in  Handbook  of  Ezpenmenta 
Psychology,  edited  by  S.  S.  Stevens  (John  Wiley  &  Sons,  Inc.,  New 
Vork,  1951),  Chap  26,  pp.  1055-1058. 

*  M.  E.  Hawley  and  K.  D.  Krvter,  “Effects  of  Noise  on  Speech,” 
in  Handbook  of  Sense  Control,  edited  bv  C.  M.  Harris  (McGraw- 
Hill  Book  Companv,  Inc.,  New  York,  1957),  Chap.  9,  pp.  9-5 
to  9-11. 

*  J.  C.  R.  Licklider,  "Three  Auditor)-  Theories.”  in  Psychology: 
A  Study  of  a  Science.  Study  /,  Conceptual  and  Systematic.  Vd.  I : 
~  ••wry.  Perceptual  and  Physiological  Formulation,  edited  by 

Koch  (McGraw-Hill  Book  Comiianv.  Inc..  New  York,  1959). 

*K.  D.  Krvter,  "Methods  for  the  Calculation  and  Use  of  the 
Articulation  Index,”  J.  Acoust.  Soc.  Am.  34,  16S9-J697  (1962). 

* K.  D.  Krvter,  “Validation  of  the  Articulation  Index,”  J. 
Acoust.  Soc.  Ain.  34, 1698-1702  (1962). 
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FiC.  1.  Articulation  index  (AI)  calculations  for  16  equally  <peech-interfering  noises.  The  ordinate  is  the  AI  (right)  or  the  average  speech- 
tp-naise  difference  over  20  eoually  important  speech  bands  or  over  5  to  6  octaves  (left).  The  onfinaie  increases  in  a  positive  direction 
downward  so  the  AI  data  will  be  compatible  with  all  other  data  (SIL,  weighting  network,  etc.)  measured  in  this  paper  and  in  Ret.  ?. 
That  is,  the  higher  the  data  points,  the  noisier  the  noise  is  rated. 

Four  different  speech  levels  were  used  to  make  the  AI  -.-alculaticss,  the  actual  level  used  and  levels  at  --10,  +  10,  and  -f2C.  Chrcc 
circles  mean  that  S-N  levels  greater  than  30  or  Jess  than  zero  never  occurred.  AH  iii.es  are  AI  figured  on  the  20-band  method.  The 
shading  around  the  “actual  level”  line  indicates  the  difference  between  using  the  actual  or  sp-xafic  speech  spectrum  or  a  generallwd 
speech  spectra  both  on  the  tvoclave  we^hted  method  or  calculation  (from  Ref.  2).  The  data  points  around  the  +10  line  indicate  the 

5-octave  (A)  or  the  6- octave  ( . )  method  of  calculation  at  a  speech  level  of  +J0. 1  he  shaded  area  at  the  right  end  of  the  +10  line 

shows  the  difference  between  limiting  the  AI  speech-to-noise  ratios  to  between  0  and  30  (lower  boundary)  or  letting  the  speccfc-to-noise 
ratios  take  on  any  +  or  —  value  (open  circles  and  upper  edge  of  boundary). 


ured  in  dB,  so  the  AI  will  be  left  in  terms  of  average 

5- N.  Although  it  is  not  always  possible  to  compare 
AI's  directly  to  SIL’s,  it  is  possible  in  this  study  because 
the  speech  level  was  constant  for  all  16  noises,  the 
distance  between  loudspeaker  and  listener  was  fixed, 
and  all  listening  was  done  in  the  same  room.  Since  the 
speech  level  and  speech  spectrum  were  always  the  same, 
the  AI  for  each  noise  depends  on  the  noise  spectrum  and 
level  just  as  the  SIL  does. 

Figure  1  shows  the  results  of  AI  calculations  using  the 
16  noise  spectra  from  Figs.  1  through  16  ic  Ref.  2.  The 
speech  spectrum  used  for  all  but  one  calculation  is  the 
“general  speech  spectrum”  from  Ref.  6.  The  solid  and 
dashed  lines  show  AI’s  calculated  on  the  20-band 
method,  using  the  octave-band  spectral  data  from  Ref. 
2  as  the  basis  for  the  noise  spectra,  and  using  for  the 
speech  the  “general  speech  spectrum.”  The  heavy  solid 
line  between  0  and  10  dB  (or  0  and  0.33  AI)  is  the  AI 
calculation  based  on  the  “actual  level”  of  speech  used 
in  this  experiment.  The  lighter  lines  are  for  speech 
levels  ±10  dB  from  “actual,”  and  the  dashed  line  is 
for  an  assumed  speech  level  +20  dB  above  the  level 
actually  used. 

The  shaded  area  around  the  “actual  level”  line  shows 
what  the  AI  is  when  the  6-octave-band  method  is  used 
instead  of  the  20-band  method.  The  upper  edge  is  when 
the  “specific  speech  spectrum”  and  the  “acJual  level” 
as  used  in  this  experiment  are  used.  The  lower  boundary 
is  when  the  “general  speech  spectium”  is  used  with  the 

6- octave  method. 

The  triangles  and  the  dotted  line-  around  the  “+10” 


speech-level  line  are  for  the  5-octave  and  6-octave  cal¬ 
culation  schemes,  respectively. 

A  circle  on  any  of  the  lints  means  that  in  none  of  the 
20  bands  did  the  S-N  truncate  at  0  or  30  dB.  If  each 
line  had  nothing  but  these  circles,  the  lines  would  be 
exactly  10  dB  apart.  Because  of  these  truncations,  how¬ 
ever,  on  very  few  noises  are  the  lines  exactly  10  dB 
apart.  There  is  no  line  that  is  free  of  truncations  com¬ 
pletely.  The  “+10”  lint:  truncates  only  on  noises  13 
through  16,  and  for  these  noises  the  true  (truncated) 
AI  values  are  shown  as  tire  bottom  edge  of  the  shaded 
portion.  The  line  and  circles  for  noises  13  through  16  on 
the  “+10”  line  are  the  values  that  the  AI  would  have 
assumed  if  the  0-  to  30-dB  limitations  had  not  been 
adhered  tc. 

The  noises  in  this  experiment  were  adjusted  in  level 
to  limit  listeners’  scores  on  rhyme  words  to  50%.  The 
average  AI  for  a  50%  rhyme  score  is  5.4  dB  or  0.1S 
(averaged  over  the  16  noises  on  the  “actual  level”  line 
of  Fig.  1).  Montague®  shows  that  a  50%  rhyme  score  is 
equivalent  to  a  40%  P B  word  score,  and  Kryter®  finds 
that  a  40%  PB  score  is  predicted  by  an  AI  of  0.17.  At 
this  low  differential  level  of  speech  to  noise  (5.4  dB), 
many  bands  (of  the  20)  on  many  noises  (of  the  16)  show 
negative  S-N  differences.  These  negative  values  are 
called  zero,  and,  because  of  the  preponderance  of  these 
truncated  bands  at  the  zero  level,  the  “actual  level” 
line  is  quite  horizontal  in  appearance.  Actually,  on  only 


•  W.  E.  Montague,  “A  Comparison  of  Five  Intelligibility  Tests 
for  Voice  Communication  SvsTems.”  Naw  Electronics  I-abomton- 
Report  977.  PB  157-229.  AD  .15+545  (1969). 
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3  of  the  16  noises  (those  indicated  by  the  open  drdes) 
do  all  values  for  all  20  bands  fall  above  zero  (and  less 
than  30).  AH  other  noises  truncate  on  at  least  one  band. 

It  is  only  when  a  speech  level  10  dB  greater  than  that 
aciually  used  is  the  basis  of  the  calculation  that  trunca¬ 
tion  within  the  0-  to  30-dB  range  is  largely  avoided. 
When  a  speech  level  20  dB  greater  than  the  level 
actually  used  in  these  experiments  is  used  as  the  basis  of 
calculation,  truncations  occur  wher  S-N  values  of 
greater  than  30  must  be  called  30.  This  occurs  on  ali 
noises  except  noises  11  and  12. 

In  summaiy,  in  regard  to  speech  level:  There  is  no 
level  of  speech  (in  the  10  dB  steps  chosen  in  this  study) 
where  all  20  bands  in  all  16  noises  are  within  the  0-  to  30- 
dB  acceptable  range.  Using  the  “+10”  level  of  speech 
and  allowing  the  values  to  go  negative  (or  exceed  30), 
a  rating  of  the  noises  that  is  independent  of  the  0-  to  30- 
dB  range  results.  This  is  no  longer  a  true  AI  calculation, 
but  it  dees  give  a  speech-interference  measure  on  a  noise 
spectrum  by  utilizing  the  speech  spectrum.  The  true 
A I  takes  into  account  both  the  speech  spectrum  and  the 
0-  to  30-dB  S-N  range.  If  the  speech  level  is  progres¬ 
sively  decreased,  the  AI  rates  the  noises  progressively 
more  equal  in  speech  interference  until,  of  course,  all 
noises  mask  out  speech  completely,  at  which  point  all 
noises  are  indeed  equally  speech-interfering. 

None  of  the  calculations  shown  in  Fig.  1  include  any 
of  the  spread  of  masking  corrections  detailed  bvKrvter.* 
This  is  because  at  the  levels  of  noise  used  in  these  cal¬ 
culations  there  is  essentially  no  spread  of  masking.  For 
example,  on  noises  1  and  2  the  maximum  level  in  the 
octave  around  200  cps  is  90  dB.  When  this  is  converted 
to  spectrum  level,  it  is  reduced  to  67  dB  and  the  upward 
spread  of  masking  is  for  only  150  cps,  and  then  it  falls 
off  at  25  dB  per  octave  (Tabic  IV,  Ref.  6).  This  cor¬ 
rection  would  at  most  affect  only  the  lowest  of  the 
20  bands  and  then  by  less  than  5  dB.  The  total  correc¬ 
tion  in  the  20-band  average  would  then  be  a  quarter  of  a 
dB  which  is  not  distinguishable  on  plots  such  as  Fig.  I. 

However,  the  AI,  even  without  the  correction  for 
•upward  masking,  does  a  good  job  of  specifying  that 
these  16  noises,  adjusted  in  level  to  be  equally  speech- 
interfering,  are  indeed  equally  speech-interfering.  Al¬ 
though  no  table  of  scores  and  standard  de-nations  wiU 
be  included  in  this  paper,  as  was  done  in  Ref.  2,  the 
standard  deviation  of  the  AI  calculated  by  the  weighted 
octave-band-level  method  and  using  the  “general  speech 
spectrum”  was  2.4  based  on  the  16  noises  (or  2.1  using 
the  33  weighting  of  Ref.  2).  This  makes  the  AI  almost 
identical  to  the  best  rating  method,  the  300-2400-cps 
speech-interference  level  (SIL)  of  Ref.  2.  And  when  the 
20-band  AI  of  t»e  “specific  speech  spectrum”  was  used, 
the  AI  had  a  slightly  lower  standard  deviation;  namely, 
1.7. 

The  data  in  Fig.  1,  therefore,  tend  to  support  Kry- 
ter’s7  latest  evidence  in  that  they  “. . .  amply  demon¬ 
strate  the  general  validity  of  the  AI  calculated  by  the 
20-band  method.  .  .  ."The  data  also  tend  to  support  the 


Fig.  2.  Loudness  (in  phons)  and  noisiness  (in  PXDF '  of 
16  equally  speech-interfering  noises. 


condu.  ion  of  Kn-ter,  Flanagan,  ind  Williams*  that  “the 
octave  band  method  for  the  calculation  of  AI  can  be 
used  in  place  of  the  more  detailed  20-band  method 
without  any  appreciable  loss  in  the  accuracy  with 
which  speech  intelligibility  test  scores  are  predicted.” 
It  would  also  appear  that,  for  comparative  purposes,  a 
general  speech  spectrum  is  nearly  as  useful  as  the  specific 
speech  spectrum  in  calculating  the  AI. 

IL  “LOUDNESS**  AND  “NOISINESS” 

Are  noises  that  are  equally  speech-interfering  equaliy 
loud  or  equally  noisy?  The  literature  abounds  in 
methods  of  calculating  loudness.’0-11  and  a  concept  of 
perceived  noisiness,15  measured  in  novs  and  possibly 
related  to  annoyance,  has  recently  been  formulated.  The 
loudness  and  noisiness  of  these  16  equally  speech- 
interfering  noises  have  been  determined  by  use  of  the 
Mark  6  loudness  contours  of  Stevens,10  and  Kryter’s 
revised  noys  contours,1*  and  are  plotted  in  Fig.  2. 

The  curves  in  Fig.  2  show  that  equaUy  speech- 
interfering  noises  are  neither  equally  loud  nor  equally 
noisy.  In  fact,  the  relative  rankings  among  the  16  noises 
are  rated  by  loudness  or  noys  methods  almost  as  they 
were  in  Ref.  2  by  the  NCA  curves.1*  The  16  noises  arc, 
however,  noted  to  be  about  12  dB  higher  in  phons  and 
15  dB  higher  in  PNDB  than  by  the  XCA  contours. 

The  XCA  rating  is  determined  by  the  highest  peak- 
in  a  noise  spectrum  that  touches  an  NCA  contour.  Both 
the  phon  and  PXDB  calculations  give  maximum  weight 
to  the  highest  noise  peak  but  also  add  in  a  fraction  of 
all  lower  levels  all  along  the  frequency  spectrum.  It 
might  be  expected  therefore  that  the  loudness  or  noisi¬ 
ness  of  these  16  equally  speech-interfering  noises  would 
be  more  nearly  equal  than  the  NCA  ratings  because 

f  K.  D.  Krytcr,  G.  Flanagan,  and  C.  Williams,  “A  Test  if  the 
20  Band  and  Octave  Band  Methods  of  Computing  the  Articulation 
Index,”  Bolt  Bcranck  and  Newman  Inc.,  Contr.  <JSAF19(G04)- 
4061,  Rcpt.  ESP-TDR-62-4  (1961). 

“S.  S.  Stevens,  “Procedure  for  Calculating  Loudness:  Mark 
VI,”  J.  Acoust.  Soc.  Am.  33, 1577-1585  (1961). 

11  E.  Zwickcr,  “Ein  Vcrfahrcn  zur  Ecrcchnung  dcr  Lautstixkc,” 
Acustica  10,  304-308  (1960). 

3  K.  D.  Krytcr,  “The  Meaning  and  Measurement  of  Perceived 
Noise  Level.”  Noise  Control  6,  No.  5,  12-27  (1960). 

3K.  D.  Krytcr  (personal  communication);  also  shown  as  an 
appendix  (Fig.  A.1)  in  J.  T.  Broch.  “Loudness  Evaluation,” 
Brucl  &  Kjacr  Tech.  Rev.  No.  2  (1962). 

14  L.  L.  Bcianek,  “Revised  Criteria  for  Noise  in  Buildings.” 
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Fig.  3.  Plot  of  the  XCA-70.  ISO-70.  XC-70, 
10  novs  and  S  sone  contours. 


some  account  is  taken  of  noise  levels  other  than  the 
highest  peak.  And,  in  fact,  there  is  a  slight  reduction  in 
the  dispersion  of  scores:  namely,  from  a  standard  devia¬ 
tion  of  5.2  for  the  N'C A  rating  (Table  2,  Ref.  2)  to 
4.8  for  both  the  phon  and  PXDB  curves.  However,  this 
small  amount  of  leveling  does  not  appreciably  change 
the  shape.  It  still  must  be  stated  that  rating  noises  by 
undue  regard  to  the  highest  peak  that  touches  a  family 
of  noise-  (or  loudness- )  rating  contours  is  not  a  good 
method  of  describing  the  speech-interfering  properties 
of  said  noises. 

IH-  AVERAGE  CURVE-FITTING  TECHNIQUES 

In  Ref.  2  it  was  observed  that  fitting  the  peaks  of 
plotted  noise  spectra  to  families  of  NX,'*  XCA,M  and 
ISO'5  curves  was  not  a  good  way  to  rate  these  noises  as 
being  equally  speech-interfering.  And  in  Sec.  II  of  this 
paper,  it  was  observed  that  loudness10  or  noysa  cal¬ 
culations  were  not  much  better.  However,  it  has  been 
shown1-2  that  by  using  only  the  portion  of  the  (ISO) 
curves  that  were  in  the  speech  region,  say,  those 
octaves  centered  at  500,  1000,  and  2000  cps,  a  great 
improvement  resulted. 

It  should  also  be  noted  that  if  only  those  parts  of  the 
contours  centered  on  the  octaves  at  500, 1000,  and  2000 
cps  are  used,  the  families  of  contours  are  not  radically 
different.  Note,  for  example,  in  Fig.  3,  that  the  contours 

u  These  International  Standards  Organiratirn  curves  appear  in 
drafts  of  Tech.  Comm.  43  (sec  footnote  17  in  Ref.  2),  but  arc  also 
published  with  instructions  on  their  use  by  J.  If.  Janssen.  “Some 
Acoustical  Properties  .if  51iips  with  Respect  to  Xeisc  Control. 
Part  I,”  Report  Xo.  4tS  cf  Xethcrlands’  Research  Centre  T.N.O. 
for  Shipbuilding  a.v*  Xavigation,  Dclit,  The  Netherlands  (1962). 


of  the  NCA-70,  ISO-70,  NC-70,  the  10  sone,  and  the  10 
noys  ate  all  within  5  dB  of  each  other  through  this 
range.  And,  in  Let.  the  average  values  are  for  the  XCA, 
72;  ISO,  71 ;  NX.  72;  10-sone,  73;  and  10-noys,  70  dB. 
It  follows  then  that  any  of  the  above  families  of  con¬ 
tours  would  give  considerably  better  (lower  dispersion) 
ratings  on  the  16  equally  speech  interfering  noises  if 
restricted  to  the  5C0-,  1000-,  and  2000-cps  range,  and 
that  any  one  curve  would  be  nearly  as  good  as  any  other. 
In  Ref.  2  only  the  ISO  was  used  in  the  restricted  range, 
but  any  other  one  would  have  worked  as  well. 

However,  restricting  the  range  on  the  contours  while 
continuing  to  use  the  tangent-to-curve  method  does  not 
describe  the  speech-interfering  properties  of  the  16 
noises  as  well  as  the  500-,  1000-,  and  2000-cps  SLL  or  the 
“actual  level”  AI. 

It  is  possible  that  even  better  predictions  could  be 
made  if,  instead  of  fitting  the  peaks  of  r.oise  spectra  to 
families  of  curves,  some  sort  of  averaging  or  integration 
could  take  place.  For  example,  separate  readings  can  l>e 
made  of  where  the  ISO,  XCA,  or  noys  contours,  at  the 
points  500,  1000,  and  2000  cps,  become  tangent  to  the 
noise  spectra,  and  an  average  of  these  three  readings 
taken;  or  another  way  of  arriving  at  the  same  rating 
is  by  visual  averaging.  To  accomplish  this  the  curve, 
say,  the  ISO-70,  is  fitted  over  the  16  noise  spectra  such 
that  the  best  visual  average  is  obtained  (half  the  spec¬ 
trum  lies  above  ana  half  below  the  ISO-70  curve). 
When  this  average  fit  is  obtained,  the  point  where  the 
70-dB  ordinate  on  the  ISO  curve  intersects  the  ordinate 
on  the  noise  spectra  is  the  desired  rating.  This  is  roughly 
equivalent  to  making  an  SIL  calculation ;  the  difference 
is  that  the  contours  are  not  flat  through  these  three 
octaves,  it  is  therefore  not  surprising  to  note  in  Fig.  4 
that  the  results  of  averaging  the  ISO-70  contour  through 
the  16  noise  spectra  are  virtually  indistinguishable  from 
the  3-band  preferred-frequency  SIL  (500,  1000,  and 
2000  cps)  replotted  from  Ref.  2.  And  the  standard  de¬ 
viations  are  also  equal  at  2.8. 

It  would  appear  from  these  results  that  a  better  way 
to  use  N'C,  XCA,  or  ISO  contours  to  rate  the  speech- 
interfering  properties  of  noise  would  be  to  find  the 
lowest  noise-rating  curve  that  averages  out  the  variations 
in  the  noise  spectrum  at  500,  1000,  and  2000  cps  and 
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Ftc.  4.  Sjrt-sch  interference  £IS0(R)AVE3  l“scd  on  averaging 
the  muh'rcqucnci  region  (500.  1000,  anti  2000  cps)  of  the  ISO 
contours  through  the  16  equally  speech-interfering  nese  spectra. 
Plotted  irom  Ret.  2  for  comparison:  the  SIL  (5W-,  1000-,  and 
2000-qis  3-band  average)  as  open  circles,  and  as  triangles  the  AI 
(general  sjiccch  spectrum,  actual  level,  and  on  octave-band  l>asis) 
arbitrarily  set  equal  on  noise  Xo.  10  (thcrmil,  flat  noise). 
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not,  as  presently  used,  detennine  the  lowest  noise- 
rating  curve  just  not  exceeded  by  the  plotted  noise  spec¬ 
trum.  In  other  words,  average  out  any  noise  peaks; 
don't  let  the  highest  noise  peak  determine  the  rating. 

IV.  COMMON  SPECTRAL  CHARACTERISTICS 

Many  of  the  results  of  this  study  point  to  a  re-examin¬ 
ation  of  the  “most  important  speech  frequency.”  For 
example,  the  SIL  that  predicted  these  data  best  was 
the  300 -2400 -cps  average,1  or  the  3-band  preferred- 
frequency  octaves  centered  at  500,  1000,  and  2000  cps, 
and  not  the  more  conventional  600-4S00-cps  average. 
Likewise,  the  AI  weighted  the  low  frequencies  too  little 
and  the  high  frequencies  too  much.  And  when  tangent- 
to-curve  methods  or  average  curve-fitting  methods  were 
restricted  to  the  three  octaves  centered  around  1000  cps, 
they  predicted  much  better.  These  data  tend  to  imply 
that  the  lower-frenuencv  components  Gf  the  masking 
noises  are  more  important  than  other  investigators  have 
thought  them  to  be. 

For  example:  French  and  Steinberg,5*  using  speech 
in  the  quiet  and  normal  listeners,  progressively  high- 
and,  later,  low-pass  filtered  the  speech  until  it  became 
progressively  less  intelligible.  They  found  that  speech 
was  equally  deteriorated  when  all  frequencies  either 
above  or  below  1900  cps  were  filtered  out;  i.e.,  the 
frequency  range  above  1900  cps  was  as  important  as  the 
frequency  range  below  1900  cps.  Beranek,17  using  male 
voices  only,  found  the  crossover  frequency  to  be  1700 
under  the  same  quiet-filtered-spevch  conditions. 

However,  Pollack1*  redid  the  filtered-speech  intel¬ 
ligibility  studies,  but  added  a  broadband  noise  back¬ 
ground  and  varied  the  level  of  the  speech.  He  found 
that  the  crossover  (or  equal  importance)  frequency  in¬ 
creased  from  S00  cps  for  low  levels  of  speech  through 
1010,  1300,  1430,  to  1620  cps  for  increases  of  10  dB  in 
the  speech  level. 

Dyer55  did  the  reverse  of  Pollack :  namely,  he  filtered 
the  noise  around  a  broadband  speech  signal  and  found, 
like  Pollack,  that  as  the  speech-to-nolst  differential  in¬ 
creased  the  crossover  frequency  increased  from  about 
1000  cps  to  almost  2000  cps.  A  crossover  frequency 
can  aiso  be  found  from  Klumpp  and  Webster  data7 
and  is  shown  in  Fig.  5.  Tg  arrive  at  Fig.  5,  the  spectrum 
of  Fig.  10  (Ref.  2)  was  subtracted  from  the  spectra  of 
Figs.  1  to  9  and  11  to  15.  This  amounts  to  taking  out 
the  characteristics  of  the  playback  system  by  subtract¬ 
ing  the  flat  thermal  noise  spectrum  from  all  other 


14  X.  R.  French  and  J.  C.  Steinberg.  “Factors  Go-.-cminc  the 
Intd'igibiiitv  of  Sjwxch  Sours  W  J.  Acoust.  Soc.  Am.  19,  90-119 
(1947). 

L.  L.  Bcranck.  “Dcsicn  of  Soeech  Communication  Systems,” 
Proc.  IRE  35,  SS0-S90  (1947  . 

>*  I.  Pollack.  “Effects  of  Hich  Pass  ami  Low  Pass  Filtcrinc  on 
the  IntcHisiKlitv  of  Sfxcch  in  Noise.”  J.  Acoust  Soc.  Am.  20, 
259-266  (194$}.' 

n  W.  R.  Dvrr.  “The  Masking  of  Speech  bv  High-  and  Low-Pass 
Noise,"  Tech.  Document  Ref.t.  No.  RADC-TDK-62-29S.  Rome 
Air  Development  Center  (1962). 
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Fig.  5.  Ranges  (icpl  and  averages  (bottom)  of  4  types  of  noise 
spectra  all  adju.-ted  in  kvd  to  be  eqvailx  sjiecrh-interferins-  (Add 
3.1  dB  to  obtain  octaec-iond  level-) 

spectra.  This  is  roughly  equivalent  to  measuring  the 
electrical  voltage  across  the  transducer  instead  of  the 
acoustical  output.  This  procedure  in  no  way  changes 
any  of  the  interrelationships  among  the  noises;  it  only 
assigns  a  value  of  0  dl>  in  each  octave  to  the  flat  noisc 
spcctrum  and  plots  all  others  relative  to  this. 

The  noise  spectra  in  Fig.  5  are  plotted  in  groups:  at 
the  top  are  the  envelopes  of  spectra  of  noises  1  through 
5  and  12  (Figs.  1  through  5,  and  12  of  Ref.  2).  That  is,  a 
curve  is  drawn  through  the  highest  level  at  each  octave 
of  any  of  the  6  noises.  A  similar  curve  is  drawn  through 
the  lowest  level  assumed  by  any  of  these  noises.  The 
envelope  at  the  top  of  Fig.  5  encompasses  therefore  the 
total  range  assumed  by  any  of  the  low-frequency  noises 
between  the  frequency  limits  of  250  and  2099  cps. 

Similariy,  the  envelopes  of  noises  6.  7.  and  S  are 
plotted  and  show  a  much  smaller  dispersion.  The  plotted 
envelopes  of  noises  9, 10, 11, 13,  and  14  arc  slightly  up- 
sioped.  Noise  15  is  plotted  by  itself  since  it  is  the  only 
noise  which  is  predominantly  a  high-frequency  noise 
The  jet  noise  is  not  plotted  since  it  is  unduly  influenced 
by  the  single  frequency  components  at  3000  cps  and 
above. 

At  the  bottom  of  Fig.  5  is  the  average  spectrum  of  the 
three  types  of  noise  envelopes,  rogether  with  the  one 
predominantly  high-frequency  noise  spectra.  Observe  in 
Fig.  5  that  both  the  predominantly  high-  and  low- 
frequcocy  noise  spectra  and  the  slightly  high-  and 
low-frequency  noise  spectra  cross  cadi  othe;  at  about 
$50  cps. 

Kryt-.T7  also  has  some  data  of  speech  masked  by  noise 
which,  when  replotted,  show  a  low  crossover  frequency. 
For  example,  if,  instead  of  plotting  the  noise  spectra  of 
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Fig.  2  in  Pef.  7  to  be  equal  in  over-all  level  (measured 
by  the  C  or  flat  weighting  network  of  an  SLM),  it  is 
posable  to  plot  them  at  levels  which  are  equally  speech- 
interfering.  This  is  possible  by  utilizing  the  information 
in  Fig.  3  of  Ref.  7,  which  shows  the  speech-t>»-  (over-all 
level  of)  noise  ratio  for  given  word  scores  in  the  four 
noises  of  Fig.  2  (Ref.  7).  For  an  equal  word  so:re  of,  say, 
60%,  the  relative  levels  among  noises  can  be  found  bv 
assuming  a  given  level  of  speech  and  noting  that  the 
over-all  level  of  noise  A  is  26  dB  less  than  the  speech 
level,  noise  B  is  2  dB  less,  noise  C  b  21  dB  less,  and 
noise  D  is  7  dB  less.  To  show  the  levels  at  which  the 
noise  spectra  in  Fig.  2  of  Ref.  6  are  equally  speech- 
interfering  (at  60%),  the  noise  spectrum  can  be  re¬ 
plotted  such  that,  with  respect  to  noise  A,  noise  B  as 
reduced  in  over-all  level  by  24  dB,  C  by  5  dB,  and  D  by 
19  dB.  If  this  is  done,  the  spectrum  plots  of  the  equally 
speech-interfering  noises  intersect  each  other  between 
the  600-1200-cps  octave  and  the  1200-2409-cps  octave, 
or  at  roughly  1200  cps;  the  steeply  sloped  spectra  below 
this  at.  say,  900  cps; and  the  gentle'  sloped  ones  above 
at,  say,  1400  cps. 

It  appears  therefore  from  the  evidence  cf  Pollack,11 
Dyer,**  Kryter,7  and  this  study,  that  noise-masked 
speech  has  a  crossover  or  importance  frequency'  as  much 
as  an  octave  tower  than  the  crossover  frequencies  of 
filtered  speech  in  the  quiet  (French  and  Steinberg,1* 
and  Beranek17).  Both  Pollack!,and  Dyer3  show  that  the 
frequency  varies  from  800  or  1000  cps  to  1600  or  2900 
cps  as  the  speech  -  tomoise  differential  increases. 


SUMMARY 

This  paper  has  examined  ways  in  which  16  noises 
adjusted  in  level  tc  be  equally  speech-interfering  can 
be  rated  by  AI  and  average  curve-fitting  methods.  It 
is  a  continuation  of  a  companion  paper5  in  which  simpler 
physical  rating  or  measuring  schemes  were  evaluated. 

In  this  paper  it  was  found  that  AI  procedures  are 
good  but  oniy  marginally  better  than  the  5CO-,  10GC-, 
and  2000-cps  SIL  found  best  in  Ref.  2.  It  was  2lso  ob¬ 
served  that  the  5-  or  6-octave-band  procedures  are 
about  2s  good  as  the  20-band  method,  and  that  a 
generalized  speech  spectrum  was  almost  as  valid  as  the 
specific  speech  spectrum  for  comparative  evaluations 
among  the  Jo  noises. 

Tbe  16  equally  speech-interfering  noises  were  neither 
equally  lend  nor  equally  noisy.  But  their  speech -inter¬ 
ference  value  could  be  predicted  very  well  from  families 
of  NC,  NCA,  ISO  (loudness  or  noys)  curves  if  (1) 
only  that  part  of  any  of  the  curves  that  center  on  the 
octaves  at  500,  1(300,  and  2000  eps  is  used,  and  (2)  the 
curves  “average  through”  spectres  peaks  and  valleys. 
The  curve-fitting  techniques  do  not  work  well  if  only 
spectral  peaks  are  allowed  to  touch  them. 

Evidence  from  this  and  other  studies  shows  that  al¬ 
though  the  "importance  frequency”  for  filtered  speech 
in  quiet  is  around  1700  or  1900  cps  this  “importance 
frequency”  drops  as  much  as  an  octave  as  the  noise 
masking  increases,  or,  more  precisely,  as  the  5~X 
differential  decreases  from,  say,  30  to  5. 


PSYCHOPHYSICAL  MEASUREMENTS  OF 
EQUALLY  SPEECH-INTERFERING  NOISES 


In  Sections  III  and  V  of  this  report,  methods  of  pre¬ 
dicting  the  speech  interference  properties  of  16  diverse 
noises  were  discussed.  The  general  procedure  was  that 
13  noises  were  adjusted  in  levei  so  that  listeners  hearing 
monosyllabic  words  at  a  constant  level  via  a  loudspeaker 
obtained  59  percent  word  intelligibility  scores.  Then  var¬ 
ious  physical  measurements  were  made  on  the  16  noises 
reproduced  at  the  equally  speech-interfering  levels. 

The  results  showed  that,  measuring  the  noise  level 
in  octave  bands  and  averaging  those  bands,  the  Speech  Inter  - 
ference  Level  concept  was  the  best  method  if  simplicity  and 
accuracy  of  prediction  were  both  considered.  iVeigfcled 
6-octave  band  Articulation  Index  calculations  were  slightly 
better  in  accuracy  of  prediction. 

This  section  of  the  report  details  other  techniques 
of  measurement. 


Maskei  Threshold  Spectra 


In  Sections  in  and  V  no  physical  measurement  was 
presented  that  was  completely  satisfactory  in  specifying 
that  the  16  noises  were  equally  speech-interfering.  It  was 
believed  that  some  procedure  utilizing  the  properties  of  the 
human  ear  might  lead  to  better  results,  so  in  Section  VI 
some  measurements  are  given  that  were  obtained  by  utiliz¬ 
ing  the  masking  effects  of  the  noises. 

Three  independent  masked  audiograms  from  each 
of  two  experienced  listeners  were  found  for  each  of  the  16 
noises.  A  Pekesy  audiometer,  using  pulsed,  half-octave 
bands  of  nGise  as  the  probe  stimuli,  was  employed.  The 
balf-c-ctave  probe  stimuli  from  the  Bekesy  audiometer  and 
the  16  masking  noises  were  fed  into  a  single  Lansing  Iconic 
loudspeaker  situated  in  an  acoustically  treated  room  that 
measured  7  feet  by  10  feet  by  9  feet.  The  listener  sat  one 
meter  away  from  the  loudspeaker  and  for  each  of  the  IS 
noises  found,  by  means  of  an  attenuator  and  a  bracketing 
technique,  the  level  at  which  each  of  the  naif-octave  bands 
of  noise  centered  at  125,  180,  250,  360,  ...4000,  and  5700  c/ 
was  just  barely  audible. 

The  masking  experiments  reported  in  this  section 
were  conducted  a  year  after  the  speech  tests  described  in 
Section  HI,  IV,  and  V,  and  it  was  not  possible  to  reassem¬ 
ble  the  sime  electroacoustic  system  used  in  the  speech 
tests,  nor  to  test  in  the  same  room.  To  minimize  effects 
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due  to  the  characteristics  of  the  new  playback  system  and 
test  room,  all  masked-audiogram  spectra  3re  plotted  as 
duferences  from  the  flat,  thermal  noise  (10).  Figures 
VI- 1  to  VI- 16  (except  fig.  VI- 10)  shew  these  rrasked-audio- 
gram-derived  spectra.  Also  shown  in  figures  VI- 1  to  VI-9 
ant  VI- i  l  to  VI- 16  are  the  differences  from  noise  10  of 
third-octave  band  levels  measured  acoustically  with  the 
General  Ra-iio  Sound  and  Vibration  Analyzer,  Type  1554A. 
Likewise,  the  difference  in  octave  levels  of  all  noises  from 
the  noise  10  are  shown.  The  levels  on  which  the  octave- 
level  difference  are  based  are  taken  from  Section  III. 
Broadband  thresholds  (the  horizontal  bars)  are  also  plotted 
in  figures  VI- 1  to  VI-9  and  VI-11  to  VI-16,  but  these  will 
be  discussed  m  Section  VII , 
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HALF-OCTAVE  MASKED  AUDIOGRAM  DATA  {SHIP’S  RUMBLE 
MINUS  THERMAL  NOISE) 

DIFFERENCE  BETWEEN  SHIP'S  RUMBLE  AND  THERM' L  NOISE 
MEASURED  IN  OCTAVE  BANDS 
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Figure  VI~4.  Difference  from 
Thermal  Noise  cf  Noise  4, 
Thermal  Noise,  -6  dB  per  oc¬ 
tave  slope. 
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HALF-OCTAVE  MASKED  AUDIOGRAM  DATA  (SHIP'S  RUMBLE 
MINUS  MRMAL  NOISE) 

DIFFERENCE  BETWEEN  SHIP'S  RUMBLE  AND  THERMAL  NOISE 
MEASURED  IN  OCTAVE  BANDS 

DIFFERENCE  BETWEEN  SHIP'S  RUMBLE  AND  THERMAL  NOISE 
MEASURED  IN  THIRD  OCTAVE  BANDS 
DIFFERENCES  BETWEEN  MASKED  THRESHOLDS  OF  NOISE 
BANDS  FOP  SHIP'S  RUMBLE  AND  THERMAL  NOISE 


Figure  VI-7 .  Difference  from 
Thermal  Noise  of  Noise  7,  Ger.- 
erato  r. 
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Unlike  figures  VI- 1  to  VI-9  and  VI- 11  to  VI- 16, 
figure  VI- 10  is  the  actual  spectrum  of  noise  iO.  The  upper¬ 
most  curve  is  the  octave -band  spectrum.  It  is  taken  di¬ 
rectly  from  figure  10,  Section  III,  and  is  therefore  repre¬ 
sentative  of  the  level  and  the  acoustic  conditions  (including 
playback  system  and  room  acoustics)  of  the  eaual-speech- 
interference  part  of  these  studies.  The  next  curve  down  is 
the  third-octave  acoustic  spectrum  of  the  thermal  noise  as 
used  in  the  masked-audiogram  part  of  these  studies.  A 
different  room  from  the  previous  testing  room  (smaller  and 
with  shorter  reverberation  time),  different  playback  equip¬ 
ment  (Altec  Iconic  loudspeaker),  and  a  10-dB  lower  sound 
pressure  level  were  utilized.  The  10-dB  lower  level  was 
used  because  of  equipment  limitations.  The  masked  thresh¬ 
old  of  half-octave  bands  of  noise  heard  ;n  the  presence  of 
the  level  and  spectrum  of  thermal  noise  depicted  by  the 
third-octave  analysis  is  shown  by  the  solid  squares. 

These  three  spectra  -  the  octave,  third  octave,  and 
masked  threshold  levels  -  are  those  used  fox*  the  zero  or 
reference  lines  on  figures  VI-1  to  VI-9  and  VI-11  to  VI-16, 
that  is,  on  these  figures,  when  octave  level  differences  are 
plotted,  the  octave  level  shown  in  figure  VI- 10  is  the  common 
spectrum  or  zero  reference  line.  When  third-octave  dif¬ 
ference  spectra  sire  depicted  in  figures  VI- 1  to  VI-9  and 
VI-11  to  VI-16,  the  third  octave  spectrum  on  figure  VI  -10 
is  the  common  or  referral  spectrum.  When  half-octave 
masked-threshold  differences  are  plotted,  the  zero  refer¬ 
ence  is  the  masked  threshold  data  plotted  in  figure  VI- 10. 

The  reason  for  utilizing  the  difference  spectra  in 
figures  VI-1  to  VJ.-9  and  VI-11  to  VI-16  was  to  equate  out 
of  each  of  the  16  noises  the  amplitude  vs  frequency  response 
of  the  playback  system  (recorder,  loudspeaker,  and  room), 
and  thereby  facilitate  comparisons  with  the  earlier  studies. 

As  is  evident  in  figure  VI- 10,  the  playback  system  had  a 
generally  rising  characteristic  to  360  c/s,  a  dip  at  500  c/s, 
and  a  falling  characteristic  above  2500  c/s.  But  all  16 
noises  had  this  same  relative  response;  therefore  the  use 
of  difference  spectra  essentially  eliminates  this  as  a  source 
of  undue  complexity. 

The  bottom  of  figure  VI- 10  shows  the  spectrum 
level  of  the  thermal  noise,  based  on  the  third-octave  spec¬ 
trum  data.  The  difference  between  the  masked  threshold 
data  and  the  spectrum  level  data  increases  with  rise  in 
frequency,  i.e.,  from  20  to  30  dB  between  250  to  5000 
cycles.  This  difference,  according  to  Hawkins  and  Stevens, 16 
is  sometimes  taken  as  defining. .  .the  critical  bandwidth  of 
a  masking  noise.  "  They  were  concerned  with  pure  tones 
masked  by  w*hite  noise,  whereas  the  present  data  concern 
half-octave  bands  of  noise  masked  by  white  nc  ise. 

Greenwood,  in  an  extensive  study  of  critical  oanduddth, 
used  narrow  bands  of  noise  as  probes,  but  his  noises  were 


much  narrower  than  half-octaves  and  his  masker  was  pure 
tones,  not  broadband  noise.  Greenwood's  data  on  pure 
tones  agree,  however,  with  Hawkins  and  Stevens.  Perhaps 
the  similarity  (of  noise  in  noise)  vs  the  distinctness  (of 
tones  in  noise),  accounts  for  the  5-dB  difference  noted 
between  the  two  curv's  shown  at  the  bottom  of  figure  VI-10. 
However,  differences  in  mode  of  presentation  (earphone  vs 
loudspeaker),  choice  of  method  (adjustment  vs  Bekesy 
audiometer),  monaural  vs  binaural  listening,  or  between 
Hawkins  and  Scevens,  the  two  subjects,  and  those  of 
Greenwood,  could  contribute  to  the  difference. 
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Figure  VI-10 .  System  respons 
of  thermal  noise  into  loud¬ 
speaker  in  sound- t r eat ed ,  but 
non-anecho  ic  room.  This  is 
the  reference  noise  zo  which 
all  other  noises  (fig.  VI- 1 
to  9  and  11  to  16)  are  com¬ 
pared.  No  log  amplitude¬ 
time  flat  is  shown  but  the 
time  variation  is  just  slight 
ly  greater  then  the  trace 
shown  on  figure  VI-15.  The 
upper  three  black  curves  are 
physical  measures.  The  spec¬ 
trum  level  is  calculated  from 
the  third-octave  data. 
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Figure  Vl-11.  Difference  from 
Thermal  Noise  of  Noise  11, 
Arresting  Gear.  There  are  three 
masked  threshold  spectra,  one 
for  each  of  three  levels  of 
fluctuating  noise. 
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Figure  VI- 14 .  Diffe rsnce 
Thermal  Noise  of  Noise  14 
wriier. 
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Figures  VI- 1  to  VI- 16  show  that  in  general  the  half- 
ociave  masking  data  (heavy  solid  line)  are  closely  equivalent 
to  the  third-octave  filter  data  (grey  line).  There  is  not 
a  great  deal  of  upward  masking  for  the  noises  with  most  of 
the  energy  at  low  frequency  shown  in  figures  VI-1,  VI-2, 
VI-3,  and  VI-4.  That  is,  the  heavy  solid  line  in  these  four 
figures  does  not  lie  consistently  above  (nor  to  the  right  of) 
the  grey  line.  In  figures  VI-5,  VI-6,  and  VI-7,  there 
may  be  some  upward  masking.  Figure  VI- 1 5  shows  the 
reverse,  a  downward  shift.  The  octave  band  data  do  not 
always  agree  too  well  with  either  the  third-octave  physical 
data  or  the  half-octave,  masked-audiogram  data.  It  should 
be  pointed  out  that  the  octave  results  are  derived  from 
eariier  data  (Section  III)  where  a  different  audio  system  was 
used  for  reproducing  the  noises.  The  largest  discrepancies 
occur  on  the  most  fluctuating  noises.  Nos.  6,  9,  11,  and  14. 

Note  that  the  half-octave  masked  threshold  data 
often  level  out  the  peaked  spectra  information  shown  in  the 
data  derived  from  the  third-octave  filters  (see  Nos.  2,  6, 

9,  11,  13,  and  16).  This  is  because  the  third-octave 
filters  "see”  everything  within  their  passband,  and  peaks 
(tonal  components)  result  in  high  numerical  readings.  When 
the  listeners  find  the  threshold  for  a  half-octave  band  of 
noise  in  the  presence  of  these  same  peaks,  they  essentially 
hear  around  the  tonal  components  and  base  their  threshold 
on  parts  of  the  banded  noise  outside  or  beyond  the  peaked  or 
tonal  components. 


Broadband  Masked  Thresholds 

Also  shown  in  figures  VI- 1  to  VI- 16  (except  fig. 
VI-10)  are  broadband-noise  thresholds  for  eac^  noise.  Pour 
broadband  noises  are  used:  300  to  2400-,  300  to  4800-, 

600  to  4800-,  and  200  to  6800-c/s  bands.  As  on  all  plots 
in  figures  VI- 1  to  VT-9  and  VI- 11  to  VI- 16,  these  broad¬ 
band  thresholds  are  expressed  as  differences  from  the  re¬ 
spective  broadband  thresholds  found  for  the  thermal  noise. 
Although  not  plotted  in  figure  VI- 10,  the  broadband  masked 
thresholds  in  the  presence  of  72  dB  of  thermal  noise  were 
65  dB  for  the  300  to  2400-c/s  band;  67  dB  for  the  300  to 
4800-  or  600  to  48'J0-c/s  band;  and  68  dB  for  the  200  to 
6800  band. 

In  figures  VJ-l  to  Vl-9  and  VI- 1  i  to  VI-16  the 
broadband  thresholds,  expressed  as  differences,  are  indi¬ 
cated  by  horizontal  ines  that  define  the  bandwidth  of  the 


YJ-14 


band.  As  would  be  expected,  the  threshold  for  these  broad¬ 
bands  of  noise  in  the  presence  of  predominantly  low-fre¬ 
quency  noises  is  determined  by  the  high-frequency  cutoff 
of  the  probe  band  noise.  The  threshold  is  successively 
lower  as  the  high-frequency  cutoff  of  the  probe  noise  band 
is  successively  higher  (noises  and  figures  VI- 1  to  VI-6  and 
to  a  lesser  extent,  noises  and  figures  VI-9  and  VI- 12).  For 
the  flatter  spectrum  noises  (Nos.  7,  8,  11,  13,  and  14), 
the  broadband  thresholds  are  roughly  equivalent  regardless 
of  either  high-or  low-frequency  cutoff.  For  the  one  clear- 
cut,  high-frequency  noise  (No.  15),  the  noise  bands  with  the 
lowest  frequency  cutoffs  have  the  lowest  thresholds.  Note 
that  a  line  drawn  through  the  upper  cutoff  frequencies  of 
the  probe  noise  bands  is  roughly  parallel  to  the  masking 
noise  spectra  on  the  low-frequency  noises.  And  conversely, 
a  line  through  the  lower  cutoff  frequencies  of  the  pulse  noise 
bands  is  parallel  to  the  high-frequency  masking  noise  spec¬ 
trum  in  figure  VI- 15. 

The  question  is  posed  as  to  whether  the  masked 
thresholds  of  these  broadband  probe  noises,  which  were 
chosen  to  lie  in  the  important  speech  frequency  regions, 
will  give  a  measure  of  the  speech-interfering  properties 
of  the  16  masking  noises.  It  is  important  to  know  how  much 
variation  there  is  in  the  300  to  2400-c/s  band  threshold 
among  the  16  noises.  In  columns  1  and  2  of  table  VI-1  are 
listed  the  number  and  names  of  the  16  noises.  Column  3 
is  a  weight  that  reflects  an  estimate  of  how  often  noises 
of  this  type  would  occur  in  larger  samples  of  ship  and 
possibly  industrial  noises.  Column  4  lists  the  300  to 
2400-c/s  band  thresholds  as  read  directly  from  figures 
VI- 1  to  VI- 16  (No.  10  is  by  definition  zero  since  every 
noise  is  compared  to  it).  The  mean  and  standard  deviation 
of  these  16  numbers,  the  mean  and  standard  deviation  of  the 
33  numbers  represented  by  three  noises  like  No.  1,  one 
like  No.  2,  fi  ve  like  No.  5,  etc. ),  and  the  extent  of  the 
spread  between  the  highest  and  lowest  number  (the  range) 
are  listed  below  columns  4  through  12.  Columns  5,  6,  and 
7  list  the  othc-r  broadband-noise  thresholds  taken  from  fig¬ 
ures  VI- 1  to  VI- 16.  Column  8  lists  the  threshold  obtained 
using  a  voice  babble  as  the  probe  stimulus  to  assess  the 
masking  of  the  IS  noises.  These  voice  babble  data  are  not 
plotted  in  figures  VI- 1  to  VI- 16. 
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Psyehophysioa!  Speech  Interference  Levels 


In  Section  III  the  concept  of  the  SIL  was  explained 
and  calculations  using  a  variety  of  different  combinations 
of  octave -bands  were  tabulated  (table  II  Section  III).  To 
show  comparisons  between  the  masked  threshold  data  and 
-  some  of  the  better  methods  in  Section  III,  five  columns  of 

SIL  calculations  are  reproduced  from  Section  III  and  listed 
in  columns  9  through  13  of  table  VI- 1. 
t  In  columns  14  and  15  are  similar  SIL  scores  based 

on  the  spectra  of  figures  VI- 1  through  VI-16.  These  are 
called  psychophysical  SIL's  because  they  are  calculated 
from  spectra  arrived  at  by  the  masked  threshold  technique. 
Since  all  noise  spectra  in  figures  VI- 1  through  VI-9  and 
VI- 11  through  VI- 16  are  difference  spectra  (the  reference 
cr  zero  line  represents  the  spectrum  of  the  flat,  thermal 
noise),  the  SIL's  are  all  relative  to  an  SIL  of  zero  for  the 
flat,  thermal  noise),  the  SIL's  are  all  relative  to  an  SIL  of 
zero  for  the  flat,  thermal  noise  (No.  10).  Columns  14  and 
15  show  the  psychophysical  SIL's  for  both  the  three-  and 
four-band  set  of  octaves  based  o.  ‘he  "preferred"  center 
frequencies  of  250,  500,  1000,  and  2000  c/s. 

To  determine  which  of  columns  4  through  1 5  in  table 
VI- 1  has  the  smallest  dispersion  among  the  16  noises,  the 
standard  deviations  must  be  examined.  Column  13  has  the 
smallest  standard  deviation.  However,  if  the  standard  error 
of  the  standard  deviation  (1//2N)  is  considered,  columns 
9  through  14,  except  for  column  11,  must  be  adjudged 
equivalent.  Columns  4,  5,  and  6  could  hardly  be  excluded 
statistically,  and  it  could  be  considered  that  columns  9  to 
14  (except  11)  are  in  one  class,  columns  4  to  7  and  11  in 
another,  and  column  3  off  by  itself.  In  any  grouping  it  is 
evident  that  the  physical  SIL's  from  Section  HI  are  not 
statistically  different  from  the  psychophysical  SIL's  calcu¬ 
lated  from  the  data  in  this  section. 

Figure  VI- 17  summarizes  the  SIL  and  broadband 
threshold  data  tabulated  in  table  I  (Section  HI).  In  every 
case,  the  measures  have  been  set  equal  on  r~*ise  10.  From 
the  present  data,  and  those  in  Section  HI,  it  would  appear 
that  any  SIL  taking  into  account  the  octave  belmv  600  c/s 
would  predict  the  speech-interfering  properties  of  these 
*  noises  better  than  the  presently  used  600  to  4800-c/s  SJL. 

The  psychophysical  SIL's  are  good  predictors,  but  not 
superior  to  physical  SIL's  and  much  more  difficult  to  de- 
«  termine.  Data  derived  by  aural  detection  of  selected  noise 

bands  in  wider  noises  are  thus  neither  or  greater  nor  less 
value  than  those  from  SIL  methods. 
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Figure  71—17.  Speech  Interference  Levels  (SIL’s) 
for  16  equal 1 y-s peech- interfering  noises.  All 
levels  are  relative  since  they  are  equated  on 
Noise  10,  Thermal  Noise.  The  psychophysical 
and  physical  SIL’s  at  the  top  are  averages  of 
levels  in  the  octave  bands  centered  at  250,  500, 
1000,  and  2000  c/s  i four—  bar.dj  or  500 ,  1000, 
and  2000  c/s  ( three- band ) .  The  physical  SIL’s 
at  the  bottom  are  based  on  averages  of  levels 
in  the  octave  bands  between  300  and  2400  c/s, 

300  and  4800  c/s,  and  600  and  4800  c/s .  The 
psycho  pkys  ical  SIL’s  fat  the  very  top)  are 
based  on  the  half-octave,  masked-threshold, 
difference  spectra  plotted  in  figures  71-1 
to  VI-16.  The  phys  ical  SIL’s  are  from  the  octave 
levels  printed  numerically  on  figures  1  to  16 
of  Section  III  (and  equated  to  Noise  10).  The 
broadband  threshold  data  are  from  the 
horizontal-bar  results  in  figures  VI-1  to  VI-16. 
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A  SPEECH  INTERFERENCE  NOISE  RATING  CONTOUR 


ABSTRACT 


An  attempt  has  been  made  to  show  the  similarities 
between  three  ostensibly  different  methods  of  rating  noises 
for  speech  interference.  The  three  basic  methods  are: 

Sound  Level  Meter  (SLM)  readings  using  various  frequency 
weighting  networks;  Noise  Criteria  Contours,  where  spectral 
peaks  of  noise  become  tangent  to  one  of  a  family  of  rating 
curves;  and  average-level  methods,  the  Articulation  Index 
(AI)  being  the  most  sophisticated  method  and  the  Speech 
Interference  Level  (SIL)  being  the  simplest  to  use. 

A  Speech  Interference  (SI)  curve  has  been  evolved 
which,  when  used  as  a  frequency  weighting  network  in  a 
SLM,  or  as  a  noise-rating  curve,  or  as  a  curve-fitting 
method  of  arriving  at  an  SIL,  greatly  reduces  the  spread 
of  scores  among  the  three  measurement  methods  when  rating 
the  speech-interfering  properties  of  certain  16  noises. 
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VU-1 


INTRODUCTION 


In  a  recent  series  of  papers  by  Klumpp  and  Webster 
(1,  2)  and  Webster  and  Klumpp  (3),  physical  and  psycho¬ 
physical  schemes  were  examined  that  purported  to  measure 
the  speech-interfering  aspects  of  noise.  Sixteen  diverse- 
spectrum  noises  were  adjusted  in  level  so  that  listeners 
hearing  monosyllabic  (Rhyme)  words  at  a  constant  level  of 
78  dB  from  a  loudspeaker  obtained  50  percent  word  intelli¬ 
gibility  scores.  Twenty-band,  and  5-  or  6-octave-band 
Articulation  Index  (AI)  calculations,  see  Kryter  (4),  pre¬ 
dicted  the  speech-interfering  properties  of  the  noises  very 
well,  see  Webster  and  Klumpp  (3).  However,  as  shown  by 
Klumpp  and  Webster  (2),  some  other,  and  simpler,  schemes 
worked  just  as  well;  for  example.  Speech  Interference  Level 
(3IL)  calculations,  see  Beranek  (5),  based  on  octaves 
centered  at  425,  850,  and  17G0  c/s,  or  SCO,  1000,  and  2000 
c/s.  The  A-weighting  and  Din  3  networks,  see  Peterson 
and  Bruel  (6),  of  a  Sound  Level  Meter  (SLM)  were  good,  but 
the  conventional  use  of  Noise  Criteria  (NC),  or  Alternate 
Noise  Criteria  (NCA),  see  Beranek  (7),  curves  did  not  work 
well.  However,  NC,  NCA,  and  ISO,  see  Janssen  (8),  curves 
worked  very  well  if  (1)  only  that  part  of  the  curves  center¬ 
ing  on  the  octaves  500,  1000,  and  2090  c/s  was  used,  and 
(2)  the  noise  spectra  were  allowed  to  "average  through"  a 
contour  and  not  just  touch  it  at  a  peak  value.  This  "average 
through"  or  average-curve-fitting  method  is  a  combination 
of  methods.  It  uses  contours  customarily  used  in  the  tangent- 
to-curve  method  to  arrive  at  the  equivalent  of  an  SIL  (average- 
level  method).  In  the  discussions  which  follow  it  will  not 
be  spelled  out  as  a  separate  method  but  will  be  considered 
as  just  another  average-level  method.  See  ref.  (3>  for 
more  discussion  on  this  combination  method. 

In  the  process  of  trying  all  possible  noise-rating 
schemes,  it  became  evident  that  there  were  essentially 
three  basic  ways  to  rate  the  speech  interference  properties 
of  noises.  And  although  the  three  basic  methods  differ  in 
how  they  operate,  the  best  of  each  method  was  pretty  good 
and  with  a  few  compromises  here  and  there  the  three  basic 
simple  methods  might  become  quite  comparable. 


METHODS  OF  RATING  SPEECH  INTERFERENCE 


The  three  basic  methods  of  rating  the  speech-inter¬ 
fering  properties  of  noise  are:  (1)  average-level  methods 
(the  AI  being  the  most  comprehensive,  universal,  and  the 
best  predictor,  but  the  SIL  doing  as  good  a  job  if  the  proper 
octaves  are  chosen  initially);  (2)  SLM  frequency  weighting 
networks  (A  and  Din  3  being  conclusively  better  than  either 
B  or  C);  and  (3)  tangent -to-curve  methods. 

As  stated  in  Klumpp  and  Webster  (2),  these  methods 
work  in  different  ways  and  it  is  pertinent  to  point  out  how 
they  differ.  The  simplest  in  concept,  but  the  worst  in  pre¬ 
dictive  ability,  is  the  tangent -to-curve  method.  In  this 
method,  only  the  noise  component  (peak)  that  first  touches 
a  generalised  noise -rating  contour  determines  the  rating. 

Any  pure  tone  component,  or  any  restricted  band  component, 
that  differs  drastically  from  its  surroundings  dominates 
this  rating. 

The  tangent -to-curve  method  may  be  expressed 
mathematically  as  follows: 

®  -  ten  log^  r?  (1) 

a  c 

where  SB  is  any  noise-rating  criteria  desired  such  as  SC, 

SCA ,  or  ISO-  and  h  is  a  frequency-  and  sound-pressure- 
dependent  weighting  factor  (represented  by  families  of  -  C, 

SCA,  or  ISO  contours);  p_  is  the  maximum  sound  pressure 
(the  noise  spectral  peak  that  first  touches  a  given  contour); 
and  PQ  is  a  reference  sound  pressure  (usually  0. 0002  micro- 
bar).  The  magnitude  of  SB  is  the  logarithm  of  the  weighting 
factor  at  the  frequency  cf  ps  and  the  maximum  noise  sound 
pressure. 

For  the  weighted-integration  (SLM  or  network) 
method,  the  indicating  instrument  following  the  weighting 
network  in  a  SLM  adds  components  powerwise,  i.e.  two 
equal  components  result  in  a  level  3  dB  greater  than  either 
level  individually;  nevertheless,  as  in  the  tangent-to-curve 
method,  a  single  component  10  dB  greater  than  all  its 
neighbors  essentially  determines  the  level.  The  frequency 
weighting  network  method  can  be  expressed  mathematically 
as  fellows: 

*  (2) 

27=1010^  | 

where  w  is  a  sound-level  weighting  reading;  ^  is  a  fre¬ 
quency-dependent  weighting  factor  determined  by  the  cefini- 
nition  of  — are  sound  pressures  in  contiguous  bands. 
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The  magnitude  of  w  is  the  logarithm  of  weighted  sums  of 
squared  band  sound  pressure. 

The  average -level  methods  (AI  and  SIL)  work  con¬ 
versely.  Whereas  the  tangent  and  network  methods  are 
determined  by  one  (tangent)  or  more  (network)  sound  pres¬ 
sure  peaks  and  give  readings  equal  to  (tangent)  or  greater 
than  (network)  the  highest  peak  sound  pressure  level,  the 
average-level  methods  yield  measures  lower  than  any  single 
peak  by  the  inclusion  of  lower  levels. 

The  average-level  method  can  be  expressed  mathe¬ 
matically  as  follows: 


SIL  =10  log 


l-n 


10 


}!n 


(3) 


kstf>P)Pl x  •  •  P\ 


where  SIL^  is  a  Speech  Interference  Level  of  n  bands, 
both  the  number  of,  and  the  location  of  the  bands  must  be 
specified;  <is  a  frequency  and  sound  pressure  level  de¬ 
pendent  weighting  factor  (but  for  SIL  calculations  in  the  past, 

«  has  been  equal  to  -  1);  ?x~  -  .?n  are  sound  pres¬ 

sure  levels  in  specified  bands.  As  a  consequence  of  the 
properties  of  logarithms  an  equivalent  SIL  result  can  be 
obtained  by  taking  the  arithmetic  mean  of  the  levels  and 
adding  a  constant  which  will  depend  on  the  weighting  —  a 
process  less  involved  than  taking  the  individual  differences 
and  averaging,  but  a  process  that  can  be  used  only  if  k  is 
neither  zero  nor  a  function  of  s;  i.  e.  all  SIL  differ  by  con¬ 
stants.  The  magnitude  of  an  SIL  is  the  logarithm  of  a  pro¬ 
duct  of  weighting  factors  plus  the  logarithm  of  a  harmonic 
mean  of  band  sound  pressure. 

As  an  example  of  how  these  three  methods  differ, 
consider  a  noise  that  had  equal  sound  pressure  level,  say, 

30  dB,  in  each  of  four  pertinent  octave  bands.  This  noise 
would  be  rated  80  by  the  average-level  method,  86  by  the 
integration  method,  and  80  by  the  tan gent-tc- curve  method. 

A  tonal  component  of  90  dB  in  one  band  would  change  the 
average  level  to  82. 5,  change  the  network  reading  to  91. 1 
and  the  tangent -to-curve  rating  to  90.  Two  90-dB  tonal 
components  in  adjacent  octaves  would  change  the  average 
level  to  85,  the  integration-method  reading  to  93. 4  and  the 
tangent  rating  to  90.  Two  90-dB  tonal  components  in  the 
same  band  would  change  the  average-level  to  83.  3,  the  inte¬ 
gration-level  to  93.  S,  and  the  tangent  rating  to  93.2. 

Licklider  and  Guttman  (9)  have  shown  that  tonal  components 
do  not  mask  speech  very  effectively  so  the  speech  interference 
properties  of  these  four  hypothetical  noises  would  be  approxi¬ 
mately  equal. 

To  summarize  the  example:  The  average-level 
measures  on  these  hypothetical  noises  varied  from  80  to 
85,  the  integrated  levels  from  86  to  93.6,  and  the  tangent 
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rating  from  80  to  93.  The  weighted  5-octave  band  AI  would 
act  much  the  same  as  the  average-level  measure  except  that 
weighted  averages  would  be  involved  and,  in  fact,  in  the 
examples  above,  changes  in  AI,  expressed  in  dB,  could  be 
as  large  as  5. 9,  depending  on  which  octaves  the  two  tones 
were  in  and  whether  the  level  of  speech  kept  the  speech-to- 
noise  (S-N)  within  the  0  to  30  dB  range.  To  express  AI  in 
dB,  recall  that  an  AI  of  0  corresponds  to  an  average  S-N 
differential  of  0,  an  AI  of  1  corresponds  to  a  30  dB  S-N,  an 
AI  of  0.  5  to  15  dB,  etc.,  so  ar.y  value  of  AI  can  be  expressed 
as  some  S-N  between  0  and  30. 

It  follows  from  the  above  discussion  and  from  the 
results  presented  in  Klumpp  and  Webster  (2)  that  on  any 
given  noise  the  integration  methods  will  give  the  highest 
numerical  ratings  (two  equal  peaks  together  add  3  dB),  the 
tangent-to-curve  method  next  (highest  peak,  or  peaks, 
determines  rating,  no  summing),  and  the  averaging  methods 
the  lowest  ratings.  This  is  strictly  true  only  if  the  frequency¬ 
weighting  networks  have  the  same  general  frequency  vs.  level 
shape  as  the  inverse  of  the  tangent-to-curve  rating  contours. 

Figure  1  shows  the  general  similarities  among  the 
common  SLM  weighting  networks  and  the  common  families 
of  noise-rating  contours.  Note,  for  example,  that  the  loud¬ 
ness,  see  Sevens  (10),  annoyance,  see  Kryter  (11)  and 
Kryter  and  Pearsons  (12),  and  NC  curves  are  quite  similar 
in  shape,  especially  at  the  low  frequencies  and  for  the  ”70” 
contours,  and  that  all  of  them  tolerate  less  low-frequency 
sound  than  the  NCA  carve.  It  should  be  pointed  cut  that  loud¬ 
ness  and  annoyance  calculations  are  not  Just  simple  tangent- 
to-curve  calculations.  They  are  different  inasmuch  as  com¬ 
ponents  ether  than  the  peak  spectral  component  are  taken 
account.  In  this  regard  they  act  more  like  SLM  weight¬ 
ing  networks-  Also  note  that  for  low-frequency  sounds  at 
least,  the  NCA-70  rating  curve  and  the  A  weighing  network 
curve  are  very  similar  in  shape.  Yeung  (13)  has  already 
made  a  case  for  using  the  A  weighting  network  for  rating 
sounds  because  cf  the  general  similarities  in  shape  between 
the  A  and  the  NC— 10  contours.  Young  (14)  has  cow  shown 
that  even  loudness  and  annoyance,  at  least  for  office  noises, 
are  fairly  well  predicted  by  the  A  weighting  network. 

Since  the  A  network  is  very  similar  in  shape  to  the 
inverse  of  the  NC  and/or  NCA  contours,  it  is  not  surprising 
that  the  ratings  assigned  to  the  16  equally  speech-interfer¬ 
ing  noises  in  table  2  of  Klumpp  and  Webster  (2)  are  higher 
in  magnitude  on  A  weighting  (83.  5  dB)  than  on  NCA  curve- 
limiting  (78. 7  dB),  and  that  both  are  larger  than  the  3IL 
(73. 7  dB).  (These  data  are  reproduced  in  columns  5,  8, 
and  13  of  table  1. ) 

The  absolute  magnitude  of  tbe  ratings  assigned  by 
variants  of  the  three  basic  methods  is  not,  however,  the 
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caost  important  fscet.  It  ;s  the  dispersion  of  the  ratings 
assigned  to  the  16  equally  speech-interfering  noises  that  is 
important.  In  this  regard,  according  to  Klumpp  and  Webster 
{2}  and  Webster  and  Klumpp  {3),  the  averaging  methods  are 
generally  superior  (the  AI,  511=,  or  average  curve-fitting), 
the  frequency-weighting  networks  next,  and  the  tangent-to- 
curve  methods  worst.  (This  can  be  observed  by  noting  that 
both  the  Ranges  and  Standard  Deviations  increase  from 
column  33  to  column  5  to  column  8  of  table  i.  > 
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TABU  VII-1.  VARIABILITY  ASSOCIATED  WITH 
VARIOUS  MEASUREMENT  PROCEDURES 
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PURPOSE 


It  is  the  purpose  of  this  paper  to  construct  a  Speech 
Interference  (SI),  frequency-weighting  curve  that  can  be 
used  (1)  to  calculate  a  weighted  SIL,  (2)  as  a  filter  in  a  SLM, 
and  (3)  as  a  substitute  for  the  NC  type  (NC,  NCA,  and  ISO) 
contour  at  the  70  dB  level.  The  curve  will  be  designed  to 
measure  only  the  speech-interfering  properties  of  noises. 

To  the  extent  that  speech  interference  is  the  determining 
factor  in  the  judged  loudness,  annoyance,  or  office  environ¬ 
ment  acceptability,  this  speech  interference  contour  will 
measure  that  quantity. 
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Specifically,  a  contour  will  be  developed  that  reduces 
the  dispersion  among  the  ratings  of  the  16  equally  soeech- 
interfering  noises  reported  by  Klumpp  and  Webster  (2). 

The  purpose  will  be  to  devise  methods  and  means  of  better 
estimating  the  speech-interfering  properties  of  noise  with¬ 
out  using  the  more  involved  AI  technique. 


SPEECH  INTERFERENCE 


On  the  basis  of  the  results  of  Klumpp  and  Webster 
(2),  the  guidelines  for  developing  a  speech  interference 
contour  at  the  noise  and  speech  levels  used  for  those  studies 
are  clear.  For  50  percent  scores  in  relatively  high  level 
noises  (as  compared  to  acceptable  offices),  the  frequency 
regions  of  the  noise  that  limit  the  speech  are  centered  at 
500,  1000,  and  2000  c/s.  If  the  speech  interference  contour 
is  to  be  used  as  a  filter  network  in  a  sound  level  meter, 
sound  of  frequency  below  300  c/s  and  above  3000  c/s  must 
be  discriminated  against.  Likewise,  when  used  as  a  tangent  - 
to-curve  determiner  the  same  frequency  cutoffs  must  be 
observed.  When  used  as  a  shaping  network  for  calculating 
an  SIL  or  average-curve-fitting  within  the  octaves  500, 

1000,  and  2000  c/s,  the  center  octave  needs  to  be  emphasized 
somewhat  more  than  the  others. 

With  these  general  guidelines  a  contour  labeled 
SI- 70  was  developed  as  shown  in  figure  2.  The  shape  of 
this  contour  is  determined  largely  by  the  levels  of  the 
limiting  spectra  of  the  Klumpp  and  Webster  (2)  noises  as 
shown  in  figure  5  of  Webster  and  Klumpp  (3).  Using  this 
SI-70  contour,  the  16  equally  speech-interfering  noises  of 
Klumpp  and  Webster  (2)  were  rated  as  detailed  in  table  1. 

All  of  the  ratings  in  table  1  (except  those  in  italics  which 
are  taken  directly  from  Klumpp  and  Webster  (2))  are  cal¬ 
culated  measures,  including  those  where  it  is  assumed  that 
the  inverse  of  the  SI-70  (and  labeled  A')  is  used  as  a  filter 
network  in  a  SLM. 

In  columns  1  and  2  of  table  1  are  listed  the  numbers 
of,  and  names  of,  the  16  noises.  For  comparison  reasons 
the  C  and  A  weighting  network  ratings  from  Klumpp  and 
Webster  (2)  are  shown  in  columns  3  and  5,  the  NCA  and 
ISO(R)  ratings  j  n  columns  8  and  9,  and  the  3-band  SIL  in 
column  13.  In  column  4  is  vhe  rating  that  would  result  if 
a  flat  (C)  weighting  were  used  in  a  SLM  but  bandpassed  to 
include  only  the  octaves  centered  at  500,  1000,  and  2000 
c/s.  This  column  is  labeled  C(R);  the  "R"  specified  here, 
as  everywhere  else  in  the  table,  "Restricted  Range.  " 


VII- 


<5 


130, 


120 


o 

s  r 


|ioo 


£ 

a>  90 
o 


id 


70 


5  «) 


o 

o 


50 


40 


OCTAVE  PASS  BANOS  C/S 
no  jm  no  moo  noo 


- 

— 

— 

— 

— 

— 

— 

— 

- , 

_ 

L 

■ 

■ 

■ 

H 

i 

■ 

■ 

■ 

■ 

■ 

■ 

■ 

a 

■ 

■ 

■ 

I 

■ 

1 

■ 

■ 

n 

■ 

■ 

a 

i 

■ 

■ 

■ 

K 

s 

R 

a 

■ 

i 

- 

I 

■ 

■ 

■ 

■ 

■ 

■ 

i 

■ 

■ 

■ 

■ 

125 


250 


500 

i — Hr 


1000  2000 
rrl - «— 


«a>  tooo 

-t— m4 


r-i-rtn - 

5  1 

10.000 


1 

luu 


1000 

FREQUENCY  C/S 


Figure  VII-S. 
Contours. 


Speech  Interference 


The  remaining  columns  in  table  1  are  ratings  the 
16  noises  would  get  if  the  SI-70  curve  were  used  as  a  new 
A  network,  namely  A’;  for  the  whole  frequency  range 
(column  6)  or  A'(R)  for  the  restricted  range.  Column  10 
lists  results  from  using  the  SI-70  contours  as  the  curve  for 
the  tangent -t.o-curve  method.  In  column  12  are  given  the 
measures  of  the  SI-70  curve  when  used  as  an  averaging 
curve  to  find  a  five -octave  SIL  (based  on  center  frequencies 
of  250,  500,  1000,  2000,  and  4000  c/s).  In  column  14  are 
the  results  of  restricting  this  averaging  procedure  to  the 
usual  restricted  band  (500,  1000,  and  2000  c/s). 

Below  each  column  are  two  measures  of  dispersion: 
the  range  (highest  minus  lowest  rating)  and  the  standard 
deviation;  and  the  mean  rating  on  the  16  noises.  The  rank 
order  refers  to  the  relative  smallness  of  the  standard 
deviation.  The  smaller  the  standard  deviation  the  better  is 
that  method  in  rating  the  noi  :s  to  be,  as  they  have  been 
adjusted  to  be  equally  speech-interfering. 

Some  of  the  data  in  table  1  are  plotted  in  figure  3 
and  reference  to  <.able  1  and  figure  3  makes  many  points 
very  evident.  F  or  example,  the  greatest  reduction  in  the 
variation  among  the  16  noises  occurs  by  merely  restrict¬ 
ing  the  bandwidth  to  the  octaves  centered  at  500,  1 000,  and 
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2000  c Is  (see  columns  3  and  4  and  the  two  uppermost  curves) 
The  new  SI-70  is  a  considerable  improvement  over  the  A 
weighting  when  used  as  a  filter  in  a  SL.M  (column  5  vs.  6). 
Restricting  the  bandpass  on  the  SI-70  as  a  filter  doesn’t 
improve  things  appreciably  (column  6  vs.  7). 


NUMBER  OF  NOISE 


Figure  VI1-3 .  Ratings  of  16  equally  speech- intei — 
fering  noises  by  various  methods .  Numbers  at  the 
left  edge  of  the  curves  refer  to  column  numbers  in 
tar  1 e  1 . 


When  used  as  a  Noise  Rating  curve  the  Si-70  contour 
(column  10)  is  markedly  superior  to  the  NCA  (column  8), 
the  ISO(R)  (column  9),  which  is  representative  of  both  the 
NC  and  NCA  curves  when  restricted  in  bandwidth,  and  to 
the  loudness  level  calculation  (column  11). 

When  used  as  an  averaging  curve  to  find  a  5 -band 
SIL,  the  SI-70  is  good  (column  12)  but  not  as  good  as  when 
finding  a  3-band  SIL  (column  14)  where  it  is  as  good  as  the 
conventional  method  (column  13).  This  latter  finding  is  to 
be  expected  since,  as  pointed  out  earlier,  in  SIL  calcula¬ 
tions  (as  long  as  you  stay  with  the  same  number  of  bands) 
the  weighting  should  have  no  effect  on  any  measure  of  dis¬ 
persion.  All  the  shaping  does  is  add  or  subtract  a  constant 
number  of  dB  from  the  original  straight-line  average. 

In  general,  then,  the  SI-70  curve  does  what  it  was 
designed  to  do:  It  provides  a  single  curve,  which  as  far  as 
predicting  the  speech-interfering  properties  of  relatively 
high  levels  of  noise,  (1)  makes  a  better  filter  network  than 
the  A  weighting,  (2)  makes  a  better  noise  rating  curve  than 
the  NC,  NCA,  and  ISO  curves,  and  (3)  can  be  used  as  an 
averaging  curve  to  find  an  SIL  that  is  equivalent  to  the 
3-band  preferred  frequency  SIL. 

A  comparison  among  the  best  methods  is  detailed 
at  the  bottom  of  figure  3.  In  this  plot  selected  columns  are 
replotted  from  table  1  (or  from  the  top  of  figure  3)  but  they 
are  now  equated  on  the  thermal  noise  (TN),  noise  10,  and 
all  ordinate  values  are  relative.  In  the  top  set  of  four  curves 
taken  from  columns  6,  10,  and  14  of  table  1,  with  the  AI 
calculation  from  reference  3  added  for  comparative  purposes, 
it  is  evident  that  the  S7  is  the  best  simple  predictor,  and 
that  the  new  SI  curve  w  jther  used  in  a  SLM  (column  6,  A’) 
or  as  a  noise  rating  curve  (column  10,  NCA')  is  slightly 
worse,  especially  for  low  frequency  noises  (noises  1,  2, 
and  3). 

However,  when  compared  to  the  old  A  weighting 
(column  5)  or  the  old  NCA  (column  8),  the  new  SI  (or  A’) 
and  NCA  ’  predictors  are  better,  especial!*'  high  fre¬ 
quency  noises  (noises  15  and  16). 

On  the  basis  of  the  theoretical  improvements  in 
measuring  speech  interference  shown  in  table  1  and  figure 
3,  an  SI  filter  was  constructed  and  added  as  an  external 
filter  to  a  Bruel  and  Kjaer  model  2203  SLM.  Figure  4 
shows  a  comparison  of  the  A  weighting  and  the  SI  weighting 
on  each  of  the  16  equally  speech-interfering  noises.  As 
would  be  predicted  from  table  J  or  figure  3,  the  difference 
between  the  two  weightings  is  greatest  on  the  high  frequency 
noises  (14,  15,  and  16)  and  next  greatest  on  the  low  fre¬ 
quency  noises  (1,  2,  and  3). 

Field  trials  on  U.  S.  Navy  ships  are  now  in  progress 
to  determine  if  the  new  SI  filter  is  sufficiently  superior  to 
the  A  weighting  network  in  rating  the  capabilities  for  speech 
communication  in  noisy  spaces  to  warrant  its  more  general  use. 


VH-12 


Figure  VII-4 .  Tracings  of  16  equally  speech— intei — 
fering  noises  as  seen  via  the  A  weighting  filter 
network  and  the  new  SI  weighting  filter « 
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SUMMARY 

This  paper  has  attempted  to  delineate  three  different 
methods  of  rating  noises.  The  three  basic  methods  are: 
Sound  Level  Meter  (SLM)  readings  using  various  frequency 
weighting  networks;  Noise  Criteria  Contours,  where  spectral 
peaks  of  noise  become  tangent  to  one  of  a  family  ox  rating 
curves;  and  average-level  methods,  the  Articulation  Index 
(AI)  being  the  most  sophisticated  method  and  the  Speech 
Interference  Level  (SIL)  being  the  simplest  to  use. 

Based  on  the  data  of  Klumpp  and  Webster  (2),  a 
Speech  Interference  curve  (SI-70,  figure  2}  was  evolved. 
When  this  SI-70  curve  was  used  as  a  frequency  weighting 
network  in  a  SLM,  or  as  a  noise-rating  curve,  or  as  a 
curve-fitting  method  of  arriving  at  an  SIL,  it  greatly  reduced 
the  spread  of  scores  among  the  three  measurement  methods 
over  the  16  equally  speech-interfering  noises  of  Klumpp 
and  Webster  (2). 
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GENERALIZED  SPEECH  INTERFERENCE 
NOISE  CONTOURS 


J.  C.  WEBSTER 

U.  S.  Nacy  Electronics  Laboratory,  San  Diego,  California 


Based  on  an  extensive  literature  review  of  the  effects  of  noise  on  speech  intelligibility, 
a  series  of  noise  rating  curves  are  developed.  These  Speech  Interference  (SI)  contours 
are  intended  to  bridge  the  gap  between  (1)  Noise  Criteria  (NC)  and  Alternate  Noise 
Criteria  (XCA)  curves  used  to  rate  the  suitability  of  offices,  and  (2)  2  Speech  Ir>ter- 
fswsce  (SI)  noise  rating  curve  that  predicts  die  effects  af  higher  level  noises  on  speech 
intelligibility.  The  highest  SI  contour  (SI-80)  has  a  minimum  2t  an  octave  level  of  SO 
dB  at  800  cps  and  is  steeply  -loped  both  above  and  below  800  cps.  The  lower  level  SI 
contours  have  ramHpi  at  increasingly  higher  frequencies  and  have  steep  slopes  for  fre¬ 
quencies  below  the  TTmwma  but  gradually  level  off  at  the  frequencies  above  the  mniima. 

In  another  paper,  Webster  (1964)  has  developed  a  speech  interference  con¬ 
tour  that  best  predicted  the  speech-interfering  properties  of  the  16  equally 
speech-interfering  noises  described  by  Klumpp  and  Webster  (1963).  It  is  the 
purpose  of  this  paper  to  generalize  this  single  contour  into  a  set  of  contours 
at  higher  and  lower  decibel  levels.  Ideally,  these  contours  will  extend  the  upper 
range  of  Beraneks  (1957)  Noise  Criteria  (NC)  and  Alternate  Noise  Criteria 
(XCA)  curves  for  rating  .  .  the  maximum  noise  level  at  which  office  per¬ 
sonnel  feel  they  can  accomplish  tbezr  duties  without  loss  of  performance."' 
Working  spaces  exist  that  exceed  Beraneks  (1957)  maximum  contour  (NC  or 
NCA-70)  and  that  very  often  exceed  his  recommended  maximum  of  NC-55. 
The  rationale  for  developing  these  contours  is  that  in  certain  spaces,  certainly 
some  shipboard  areas,  noise  levels  exceed  XC-55  but  wesk  must  and  does 
continue,  including  voice  communica  Hons.  In  these  areas  toe  major  criterion 
must  be  acceptable  speech  intelligibiHty  with  little  or  no  regard  for  loudness, 
annoyance;  or  comfort.  To  rate  these  spaces,  therefore,  contours  based  on 
tnafort  and  speech  ccmmunkaticn  performance  must  drop  the  comfort  (loud¬ 
ness  annoyance)  base  the  rating  and  eventual  acceptance  only  on  those 
aspects  os  noise  tint  affect  speech  rntelrtgjhqlity- 

The  contours  to  be  developed  then  will  ideally  bridge  the  gap  between  (1) 
Beraneks  (1957)  NC  and  XCA  curves  that  rate  rooms  on  aS  aspects  of  noise, 
and  (2)  Webster’s  (1964)  speech-interfering  (SI)  ccrrtcur  developed  to  predict 
the  effects  of  noise  upon  speech  inteJbgibiKty.  This  will  be  accomplished  by 
utilizing  the  results  cf  an  extensive  literature  serves-  of  the  effects  of  noise  and 
frequency  bandwidth  os  speech  intelligibility. 


LITERATURE  SURVEY 

No  discussion  of  the  effects  of  various  frequency  regions  of  noise  upon  the 
masking  cf  speech  would  be  complete  without  considering  the  data  of  Miller 
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(1947).  He  used  a  broad-band  (20-4  000  cps)  noise  and  eight  narrow  bands 
of  noise  covering  the  range  from  135  cps  to  4  000  cps.  He  found  that  as  com¬ 
pared  to  the  masking  done  by  the  broad-band  noise,  (1)  the  low-frequency 
noise  bands  did  virtually  no  masking  at  low  levels  and  were  very  good  maskers 
at  high  levels,  (2)  the  high-frequency  bands  of  noise  were  very  effective 
maskers  at  low  levels  but  didn’t  mask  much  more  as  the  levels  increased,  and 
(3)  at  moderate  to  high  levels,  bands  below  1500  cps  and  especially  those 
below  1 100  cps  masked  speech  considerably  better  than  those  above  1 300  cps. 
His  results  can  be  summarized  by  saying  that  as  the  level  of  noise  increases, 
or  as  the  S-N  (Speech-to-Noise)  differential  decreases,  the  masking  effective¬ 
ness  changes  from  higher  frequency  bands  to  lower  frequency  bands. 

Millers  (1947)  lowest  band  was  from  135  to  400  cps.  Dreher  and  Evans  (1960- 
61)  found  that  a  masking  band  of  noise  from  50  to  300  cps  never  did  decrease 
the  intelligibility  of  speech,  even  at  levels  where  a  band  from  600  to  4  800 
cps  masked  speech  phrases  completely.  They  did  find,  however,  that  the  50- 
300  cps  band,  when  added  to  the  600-4800  cps  band,  caused  an  additional 
amount  of  deterioration  in  speech  intelligibility. 

Pickett  and  Kryter  (1955),  using  sloped  broad-band  noises  to  mask  speech, 
found  some  evidence  to  support  Millers  (1947)  findings  based  on  narrow 
bands  of  masking  noises.  They  found  (see  Figure  3  in  Pickett  and  Kryter, 
1955)  feat  Tow-frequency  (LF)  noise”  (”.  .  .  —7  dB/octave  at  fee  low  fre¬ 
quency  end  to  —12  dB/octave  at  the  high  end  ...  [as  measured  in  octave- 
band,  not  spectrum,  levels]")  was  not  very  effective  in  decreasing  high  levels 
of  intelligibility  but  quite  effective  in  further  decreasing  low  levels.  Similarly, 
they  found  (Figure  4,  Pickett  and  Kryter,  1955)  that  ‘high-frequency  (HF) 
noise’  (*. . .  slope  of  fee  HF  .  . .  noise  spectrum  increased  from  -}-3  dB/octave 
to  -}-5  dB/octave  as  frequency  increased  .  .  .”)  was  quite  effective  in  de¬ 
creasing  high  levels  of  intelligibility  but  relatively  less  effective  at  decreasing 
lower  levels. 

Egan  and  Wiener  (1946),  doing  the  reverse  of  MiUer  (1947),  found  what 
band  widths  of  speech  were,  important  to  make  speech  intelligible  in  broad¬ 
band  noises-  Their  data,  which  are  summarized  in  Table  1,  show  feat  when 
speech  is  bandpass-filtered  in  broad-band  noise,  there  is  a  frequency  some¬ 
where  between  1 100  and  2  000  cps  that  is  essentially  the  center  or  important 
frequency.  As  fee  width  of  the  speech  passband  is  increased  more  or  less 
symmetrically  around  these  center  frequencies,  intelligibility  likewise  increases. 
In  general,  fee  wider  fee  band  in  octaves  fee  better  the  intelligibility.  For 
example,  if  2  600  eg.  is  subtracted  from  fee  top  end  of  the  55043  500  cps  hand, 
the  intelligibility  of  the  new  550-3900  cps  band  drops  a  measurable  amount 
(and  the  total  bandwidth  decreases  from  3-56  octaves  to  2.S2  octaves).  If 
fee  lower  end  of  the  hand  is  extended  downward  by  210  cps  (340  to  3900 
cps),  fee  intelligibility  ana  the  bandwidth  in  octaves  is  again  approximately 
equal  to  fee  original  550  to  6500  cps  band  even  though  the  bandwidth  in 
cycles  has  been  reduced  from  3 950  to  S560  cps. 
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Table  1.  Intelligibility  versus  Bandwidth  (Iron*.  Egan  &  Wier-er,  1946). 


Word  IrJedigibihty 

%  at  high  %  at  low  p  . 

S-N  values  S-N  values  Rel 

Band 

Extent 

in  cps 

Band 

Center 

in  cps 

3andtddlh 

in  cps  in  octaves 

88 

63 

1 

130-9200 

1093 

9070 

6.14 

81 

46 

2a 

550-6500 

1920 

5  950 

356 

79 

50 

2b 

340-3  900 

1150 

3560 

352 

77 

42 

3 

550-3  900 

1460 

3  350 

252 

70 

30 

4 

550-2500 

1 170 

1950 

2.18 

65 

25 

5 

870-3  900 

1840 

3  030 

2.16 

S3 

18 

6a 

870-2500 

1480 

1630 

152 

44 

22 

6b 

550-1500 

90S 

950 

1.46 

42 

12 

7 

1300-3100 

2007 

1800 

1-25 

22 

10 

8 

870-1590 

1120 

630 

0.79 

20 

5 

9a 

1300-1900 

1580 

600 

055 

18 

6 

9b 

1800-2500 

2120 

700 

0.47 

It  is  interesting  to  note  that  in  the  three  cases  where  the  bandwidths  in 
octaves  were  approximately  equal  (2a  versus  2b,  4  versus  5,  and  6a  versus  6b) 
the  bandwidth  with  the  lower  center  frequency  was  more  intelligible  at  low 
S-N  values  and  in  two  of  these  cases  (2  and  6)  the  reverse  was  true  at  high  S-N 
values. 

So  the  intelligibility  of  bandpass-filtered  speech  in  broad-band  noise  increases 
as  the  width  of  the  speech  band  increases  in  units  of  octaves  around  a 
frequency  of  about  i  500  cps.  This  center  frequency  can  be  as  low  as  906  cps 
for  bad  conditions  of  speech  in  noise  to  a  frequency  as  high  as  1 920  cps  for 
good  conditions. 

Some  generalizations  seem  evident  from  the  data  of  Miller  (1947)  and  Egan 
and  Wiener  (1946).  For  speech  to  be  very  intelligible  (80%  nonsense  syllable 
scores)  in  broad-band  noise,  the  speech  passband  should  be  about  3J>  octaves 
wide  and  centered  somewhere  between  1 100  and  2  000  cps.  Twenty  per  ™>nt 
nonsense  syllable  scores  remain  for  bandwidths  between  05  and  0.75  octaves 
if  centered  between  1 100  and  2  000  cps.  Low  frequency  bands  of  noise  (below 
1 100  cps  and  in  bands  of  1.6  octaves  or  less)  do  not  mask  broad-band  speech 
well  until  they  become  relatively  loud.  High  frequency  bands  of  noise  (above 
900  cps  and  in  bands  of  0.75  octaves  or  less)  mask  broad-band  speech  some¬ 
what  at  very  low  levels  but  do  not  mask  speech  appreciably  more  as  the 
noise  levels  increase.  At  low’  intelligibility  levels  (less  than  40%  PB  semes), 
narrow  bands  of  frequencies  above  900  cps  (especially  above  1300  cps)  do 
no  additional  masking  whereas  narrow  bands  of  frequencies  below  1 100  cps 
(especially  below  700  cps,  but,  according  to  Dreher  and  Evans,  1960-61,  cot 
below  300  cps)  do  an  appreciable  amount  of  masking. 

Can  these  trends  be  found  in  the  data  of  others?  The  trends  being  that  (1) 
a  sufficient  bandwidth  for  speech  in  noise  is  3.-5  octaves  from  340  to  3900  or 
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550-8  500,  the  center  or  important  frequency  being  between  1 100  and  2  000 
cps;  (2)  small  amounts  of  high-frequency  noise  deteriorate  speech  intelligibil¬ 
ity  somewhat  and  greater  amounts  do  no  further  damage;  and  (3)  small 
amounts  of  low  frequency  noises  won’t  deteriorate  speech  intelligibility  at  all 
but  greater  amounts  can  obliterate  speech  intelligibility. 

The  Articulation  Index  (AI),  which  is  a  measure  that  takes  into  account  the 
difference  in  the  spectrum  level  of  speech  and  of  noise  over  20  contiguous 
frequency-limited  bands,  each  of  which  contributes  equally  to  speech  intelligi¬ 
bility,  yields  a  number  that  is  a  function  of  speech  intelligibility.  The  AI  as 
calculated  by  the  Beranek  (1947)  method,  see  Kryter  (1962a),  confirms  die 
bandwidth  requirements  for  speech  in  noise;  that  is,  the  340-3900  cps  band 
contains  bands  2  through  18  of  the  20  bands  and  the  550-6  500  cps  band  con¬ 
tains  bands  4  through  20.  Both  bands  include  17  of  the  possible  20  bands  and 
would  therefore  yield  an  AI  score  of  0.85,  which,  according  to  Kryter  (1962b), 
would  permit  a  PB  word  score  of  95%.  Kryter  (1962b,  Figure  1)  also  shows  that 
speech  in  the  bands  0-600,  and  1 200-2  400  cps  is  just  as  intelligible  in  the  quiet 
as  is  speech  in  the  bands  0-600,  1 200-2  400,  and  4  800-9  600  cps.  This  indicates 
that  speech  frequencies  above  4  800  cps  do  not  contribute  appreciably  to 
intelligibility  if  there  is  sufficient  speech  energy  in  frequencies  below  2400 
cps.  In  the  same  reference  Kryter  (1962b)  shows  that  any  bandwidth  reduction 
within  the  1 200-2  400  cps  band  reduces  intelligibility  appreciably,  especially  if 
the  0-600  cps  band  of  speech  is  also  eliminated.  Kryter  (1962b)  found,  as  did 
Egan  and  Wiener  (1946),  that  nonsense  syllable  intelligibility  stays  relatively 
high  (60%)  if  only  speech  in  the  1 200-2  400  cps  band  is  passed.  Unlike  Miller 
(1947),  Kryter  (1962b)  found  that  a  band  of  high  frequency  noise  (2400-3400 
cps)  does  decrease  intelligibility  with  increasing  level  rven  if  the  speech  has 
already  been  low-passed  at  1700  cps.  But  like  Miller  (1947),  he  found  that 
increasing  levels  of  low-passed  noise  (200-1 100  cps)  decreased  intelligibility 
to  lower  levels  than  did  the  high-frequency  band  of  noise  even  though  the 
speech  was  already  high-passed  at  1 700  cps. 

In  another  paper,  Kryter  (1960)  presents  more  evidence  on  the  important 
frequencies  for  speech  when  masked  by  noise.  He  was  locking  for  three  bands, 
each  500  cps  wide,  that  would  pass  speech  with  maximal  intelligence  (and 
naturalness).  He  found  that  the  lowest  band  should  be  centered  at  500  cps  (to 
750),  the  next  at  1 750  ±250  cps,  and  the  third  from  2  500  cps  (to  3  000).  In 
comparative  tests  using  bands  centered  at  500,  1 500,  and  2  500  cps  he  found 
the  intelligibility  to  be  better  than  when  using  contiguous  bands  of  1500  cps- 
width,  geometrically  centered  at  either  400,  or  1 000,  or  1 580  cps.  But  he  also 
found  that  among  the  1 500  cps-wide  bands  the  one  from  100  to  1 600  (center 
freq.  =  400)  was  appreciably  better  than  the  higher-centered  one  at  500-2  000 
cps  which  was  in  turn  better  than  the  band  from  1000  to  2500  cps.  The 
superiority  of  the  100-1 600  cps  band  wa»  especially  evident  at  the  least 
favorable  (zero)  speech- to-noise  condition.  This  low  band  is  four  octaves  wide 
(as  compared  to  2,  and  1.32  octaves  for  the  higher  bands),  but  if  the  data  of 
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Egan  and  Wiener  (1946)  are  correct,  the  lowest  1.75  octaves,  if  not  the  lowest 
two  octaves,  could  undoubtedly  have  been  eliminated  without  any  undue  loss 
in  intelligibility. 

Pollack  (1948)  also  has  some  data  which  show  that  the  intelligibility  of 
speech  in  noise  is  actually  increased  slightly  if  the  speech  frequencies  below 
350  cps  are  eliminated.  He  also  confirms  that  speech  frequencies  above  3  950 
cps  add  no  more  to  intelligibility.  Pollack  (1948),  who  selectively  high-  and 
low-passed  speech  in  noise  backgrounds,  found  a  “.  .  .  shift  in  relative  contri¬ 
butions  to  intelligibility'  from  the  low  frequencies  at  low  intensity  levels  [and 
low  intelligibility  scores]  to  the  high  speech  frequencies  at  high  levels.  .  .  .” 
He  found  this  shift  .  .  by  noting  the  frequencies  at  which  one-half  of  the 
maximal  contribution  to  the  articulation  index  was  made.”  These  midpoints 
are,  according  to  Pollack,  .  .  800,  1 010,  1 300,  1  430,  and  1  620  cps  ...  for 
levels  of  +10—,  +20—,  +30—,  +40—,  and  +50— db  orthotelephonic  gain, 
respectively.” 

Dyer  (1962),  doing  the  reverse  of  Pollack  (1948),  namely,  filtering  the  noise 
around  a  broad-band  speech  signal,  found,  like  Pollack,  that  as  the  speech-to- 
noise  differential  increased,  the  frequency  that  divided  the  high-  and  low- 
passed  noises  into  equal  speech-interfering  increments  increased  from  about 
1 000  to  almost  2  000  cps. 

If  any  generalizations  can  be  made  from  the  literature  cited,  they  are: 
Speech  frequencies  below  about  350  cps  and  above  3900  cps  are  relatively 
unimportant  to  the  intelligibility  of  speech  in  noise  (Egan  and  Wiener,  1946; 
Kryter,  1960;  and  Pollack,  1948);  according  to  Dreher  and  Evans  (1960-61), 
noise  frequencies  below  300  cps  are  very  ineffective  in  masking  speech  at 
tolerable  listening  levels  unless  higher  noise  frequencies  are  also  present; 
according  to  Miller  (1947),  noise  bands  above  2400  cps  are  very  ineffective  in 
masking  speech;  Kryter  (1960)  states  that  the  most  important  narrow  bands  of 
speech  energy  are  centered  at  500  (to  750  cps),  1 750  ±250  cps,  and  2500  (to 
3000  cps);  Kryter  (1962b)  also  states  that  any  decrease  in  speech  bandwidth 
in  the  1200  to  2400  cps  band  reduces  intelligibility;  the  most  important  mid¬ 
frequency  in  broad-band  speech,  according  to  Egan  and  Wiener  (1946),  is 
somewhere  between  1 100  and  2  000  cps  and  the  bandwidth  required  for  high 
intelligibility  is  about  3.5  octaves;  Egan  and  Wiener  (1946)  also  state  that  at 
good  speech-to-noise  conditions  (at  high  levels  of  speech  intelligibility)  the 
important  broad-band  center  frequency  is  around  2  000  cps  and  frequencies  as 
high  as  6500  cps  may  be  important;  as  the  speech-to-noise  conditions  deteri¬ 
orate,  the  important  mid-frequency  shifts  down  to  around  1000  cps  and  fre¬ 
quencies  above  3  900  cps  are  ineffective  (Egan  and  Wiener,  1946;  Kryter,  1960; 
Pollack,  1948;  and  Dyer,  1962). 
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SPEECH  INTERFERENCE  (SI)  CONTOURS 

The  remaining  SI  contours  (SI-50,  60,  and  SO)  in  Figure  1  were  drawn  relative 
to  the  SI-70  contour  on  the  basis  of  tire  above  observations,  which  are  quanti¬ 
tative  as  regards  frequency  but  only  qualitative  as  regards  sound  pressure 
levels.  The  SI  contours  show  (1)  a  gradual  shifting  from  a  minimum  of  SCO 
cps  for  SI-80  to  2000  cps  for  SI-50,  (2)  an  increasing  disregard  of  high  fre¬ 
quency  noise  components  from  NCA-40  through  SI-50,  80,  and  70  to  SI-80, 
(3)  a  sudden  disregard  of  low  frequency  noise  components  from  NCA-40  to 
SI-50,  then  increasing  concern  for  low  frequency  noise  for  the  contours  SI-60, 
70,  and  80.  These  contours  are  developed  on  the  basis  that  for  levels  of  noise 
below  the  NC-30  contour,  comfort,  annoyance,  and  purely  aesthetic  values 
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govern  the  use  of  a  room.  At  NCA-40,  Beranek  (1957)  states  that  all  due  allow¬ 
ance  is  made  for  the  difference  between  loudness  and  speech  interference, 
•nd  above  NCA-40  the  environment  is  admittedly  adverse  and  speech  inter¬ 
ference  alone  is  hypothesized  to  be  the  determiner  of  acceptance. 

In  this  regard  it  is  interesting  to  note  that  in  specifying  the  comfort  of  air¬ 
craft  cabins  (propeller-driven),  Lippert  and  Miller  (1951)  define  as  “ideally 
quiet”  a  noise  spectrum  that  becomes  tangent  to  the  SI-70  contour  (between 
500  and  1 000  cps).  This  spectrum  is  at  least  15  dB  above  Beraneks  (1957)  NCA 
contour  of  55  which  he  describes  as  “Very  noisy;  office  environment  unsatis¬ 
factory;  . . .  Not  recommended  for  any  type  of  office.”  Here  is  a  case,  and  there 
are  others,  where  the  adaptability  level  of  humans  comes  to  their  aid.  A  noise 
level  that  makes  offices  “unsatisfactory”  is  15  dB  less  intense  than  a  noise 
judged  to  be  “ideally  quiet”  in  airplane  cabins.  Lippert  and  Miller  (1951)  define 
a  second  contour  exactly  20  dB  higher  as  “quasi-comfortable.”  This  letter  level 
is  35  db  i’:X)ve  Beraneks  “unsatisfactory  office.” 

It  is  this  adaptability  feature  of  human  behavior  that  gives  rise  to  the 
rationale  behind  the  discontinuity  in  the  contours  between  NCA-40  (where 
comfort  is  of  importance)  and  SI-50,  and  on  through  the  SI-60,  70,  and  80 
contours,  where  the  important  aspect  of  the  noise  is  its  speech  interference 
properties,  not  its  loudness,  its  annoyance,  nor  its  habitability  and  comfort 
properties. 


SUMMARY 

Because  people  do  work,  travei,  and  even  go  for  entertainment  (night  clubs) 
in  environments  where  noise  levels  greatly  exceed  “satisfactory  office  standards,” 
it  was  thought  desirable  to  extend  Beraneks  (1957)  Noise  Criteria  (NC)  curves 
to  higher  levels.  In  this  extension,  aspects  of  comfort,  loudness,  and  annoyance 
are  not  as  important  as  aspects  of  speech  intelligibility.  Therefore,  a  series  of 
Speech  Interference  (SI)  Contours  have  been  developed  based  on  what  is 
known  from  the  literature  on  the  intelligibility  of  speech  in  noise. 

These  SI  contours,  which  actually  constitute  the  summary  of  this  paper, 
show  (1)  an  increasing  disregard  for  high  frequency  noise  components  as  the 
noise  increases  from  levels  of  40  dB  to  80  dB  (as  estimated  by  an  A-weighting 
network  of  a  sound  level  meter),  (2)  a  sudden  disregard  of  low  frequency 
noise  components  as  the  noise  level  passes  60  dB(A)  and  an  increasing  concern 
again  for  A  levels  above  80  dB,  and  (3)  a  shifting  of  the  major  concern  for  noise 
components  centered  at  2000  cps  for  A  levels  of  40  dB  and  below  to  com¬ 
ponents  centered  at  1 000  cps  and  below  in  noises  with  A  levels  of  70  dB  and 
above. 

The  opinions  and  assertions  contained  herein  are  the  private  ones  of  the  writer,  and  are  not 
to  be  construed  as  oficial,  or  a*  reflecting  the  views  of  the  Navy  Department  or  the  naval 
service  at  large. 
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Relations  between  Speech-Interference  Contours  and 
Idealized  Articulation-Index  Contours 


J.  C.  Webster 

U.  S.  Nary  Htciromcs  Laboraiory,  San  Diego,  Calijemia 
(Received  22  November  1963) 


A  comparison  is  made  between  speech-interference  (SI)  contours  [developed  from  the  speech-interfering 
properties  of  a  representative  sample  of  industrial  (Navy)  noises]  and  a  set  of  articulation-index  (AI)  con¬ 
tours  lased  on  theoretical  noise  spectra  encompassing  the  most-extreme  spectra  found  among  real  noises. 
At  a  lerel  of  roughly  70  dB  [as  based  on  a  speech-interference  level  (SIL)  or  decibel  average  over  the  octaves 
contend  at  500, 1000,  and  2000  cps],  the  SI  and  AI  contours  agree  very  well.  At  lesser  levels  of  noise,  the  SI 
and  AI  contours  purposely  diverge  because  the  SI  contours  were  developed  to  bridge  the  gap  between  noise- 
criteria  (XC)  and  altemate-noise-criteria  (NCA)  curves  (developed  by  Beranek  to  rate  both  the  annoying 
and  speech-interfering  aspects  of  office  noises)  and  the  bask  SI70  (SI  at  70  dB)  cane-  The  complete  set  of 
AI  contours  points  up,  as  do  the  SI  contours,  that  as  the  ambient  noise  increases  the  importance  or  pivotal 
frequency  shifts  downward  from  around  2000  to  around  1000 cps.  The  pivotal  frequency  is  the  frequency  that 
divides  the  speech  bandwith  into  two  halves,  each  of  which  contributes  equally  to  the  total  intelligibility. 
Some  potential  uses  of  the  bask  SI  contour  (SI70)  as  a  filter  network  in  a  sound-level  meter  are  discussed. 


INTRODUCTION 

N  an  attempt  to  summarize  the  results  of  studies  on 
equally  speech-interfering  noises,1-5  a  speech-inter¬ 
ference  (SI)  contour  was  developed.5 

This  SI  contour,  a  U-shaped  contour  centered  at 
70  dB  at  1000  cps,  shows  the  levels  of  noise  in  octave 
bands  that  best  summarize  the  spectrum  and  levels 
of  the  original  16  equally  speech-interfering  noises. 

In  order  to  generalize  this  contour  to  lesser  levels  of 
noise,  two  contours  centering  on  levels  of  60  and  50  dB 
were  interpolated*  between  this  countour  and  the  40-dB 
altemate-noise-criteria  (NCA)  contour  of  Beranek.5 
And,  to  complete  the  generalization,  a  contour  centered 
on  80  dB  was  extrapolated.*  The  generalizations  repre¬ 
sent  an  attempt  to  integrate  the  data  on  the  16  equally 
speech-interfering  noises  with  other  published  data  on 


1 R.  G.  Klumpp  and  J.  C.  Webster,  “Physical  Measurements  of 
Equally  Speech-Interfering  Navy  Noises,”  J.  Acoust  Soc.  Am. 
35, 1328-1338  (1963). 

*  J.  C  Webster  and  R-  G.  Klumpp,  ''Articulation  Index  and 
Average  Curve-Fitting  Methods  of  Predicting  Speech  Interfer¬ 
ence,"  J.  Acoust  Soc.  Am.  35, 1339-1344  (1963). 

*  J.  C.  Webster,  “A  Speech  Interference  Noise  Rating  Coatoui” 
(to  be  published). 

*  J.  C.  Webster,  “Generalized  Speech  Interference  Noise  Con¬ 
tours,”  J.  Speech  Sc  Hearing  Res.  7, 133-140  (1964). 

‘  L.  L.  Beranek,  “Revised  Criteria  for  Noise  in  Buildings," 
Nose  Control  3,  No.  1, 19-27  (1957). 


the  masking  effects  of  noise  on  speech.  These  other 
studies  are  cited  and  interpreted  in  Ref.  4. 

All  of  these  SI  contours,  labeled  SI80,  70, 60,  and  50 
arc  reproduced  in  Fig.  1.  The  SI  contours  at  80  and  70 
dB  represent  the  maximum  octave-band  levels  of 
equally  speech-interfering,  quasi  steady-state  noises 
whose  average  level  in  the  octaves  centered  at  500. 
1000,  and  2000  cps  is  approximately  80  and  70  dB, 
respectively.  The  SI  contours  at  60  and  50  dB  represent 
a  compromise  between  maximum  octave-band  noise 
levels  for  equally  speech-interfering  noises  and  for 
noises  acceptable  for  office  environments,  where  factors 
such  as  loudness  and  annoyance  as  well  as  speech 
intelligibility  are  important-  The  complete  set  of  SI 
countours  bridges  the  gap  between  speech-intelligibility 
predictors — especially  the  articulation  index  (AI) — and 
the  noise-  and  altemate-ncise-criteria  (NC  and  NCA) 
curve  that  rate  the  acceptability  of  work  spaces  where 
speech  intelligibility  is  but  one  of  the  important  factors. 

The  rationale  for  the  SI  contour  at  the  70-dB  level 
(and,  subsequently,  the  generalized  SIS0-,  60-,  and 
50-dB  contours)  is  detailed  in  Refs.  1-3,  where  it  was 
pointed  out  that  there  are  three  wavs  in  which  noises 
have  been  rated  in  the  past:  (1)  by  measurement  with 
frequency-weighting  networks  in  sound-level  meters, 
(2)  by  fitting  the  peaks  of  {dotted  noise  spectra  to 
1662 
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families  of  noise-rating  curves,  and  (3)  by  band-level 
averaging.  It  has  been  shown*  that,  if  the  SI70  contour 
is  used  as  a  frequency-weighting  network  (where  it 
need  be  inverted),  as  a  noise-rating  curve,  and  as  a 
weighting  curve  for  calculating  speech-interference 
levels  (SIL’s),  the  three  methods  give  very  similar 
results  in  predicting  the  speech-interfering  properties 
of  the  !6  equally  speech-interfering  noises  of  Klumpp 
and  Webster.1 

This  paper  shows  the  relationships  between  the  SI 
contours  and  a  set  of  generalized  AI  contours,  and 
discusses  some  of  the  merits,  limitations,  and  uses  of 
both  the  SI  and  AI  contours. 

L  DEVELOPMENT  OF  IDEALIZED  ARTICULATION- 
INDEX  CONTOURS 

In  a  recent  paper  by  Cavanaugh  dal.*  (CFHW)  the  es¬ 
sentials  of  the  AI  are  displayed  in  a  manner  that  makes 
calculations  and  manipulations  relatively  easy:". . .  the 
useful  speech  signal  is  shown  as  a  dot  field  beginning  at 
200  cps  and  extending  to  6000  cps.  Each  dot  signifies  a 
possible  contribution  to  the  articulation  index.  The 
field  is  30  dB  ‘high’  and  the  greatest  density  of  dots  is  at 
2000  cps.  The  dot  field  is  drawn  for  an  average  talker 
using  ‘conversational’  speech  effort”  (Ref.  6,  p.  481). 

The  CFHW  plot,  which  they  plotted  in  third-octave 
bands,  has  been  redrawn  on  the  basis  of  octave  bands 
in  1^3  2.  Also  hr  Fig.  2  are  drawn  a  series  of  contours, 
in  5-dB  steps,  of  the  idealized  spectrum  of  a  thermal 
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Fig.  1.  SI  coatocns  (radading  the  NCA40  ana  NC30  curves 
oIEtnau'j. 


*  iV.  J.  Cavansugk,  W.  R_  FzrreH,  P.  TV.  HirtJe,  and  B.  G. 
Walters,  *^p«eh  Privacr  la  Buildings,”  J.  Acoust.  See.  A ax 
K  475-492  (1962). 
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Fig.  2.  AI  speed]  rtukra  for  “ooaveisalloasl-level”  speech-  The 
number  of  dots  in  each  J -octave  band  signifies  the  relative  con¬ 
tribution  of  speech  in  that  band  to  the  AL  A  series  of  ideahzed 
thermal  noises  with  — 6-dB/oct  spectra  arc  draws  in  5-dB  steps. 
The  number  of  dots  above  each  noise  contour  is  proportional  to 
the  AI  of  conversational  4evd  speech  in  that  level  of  tsasc.  (After 
Cavanaugh  ef  cF,‘  Fig.  5] 

noise  with  a  minus  6-dB/oct  spectrum  slop^  The 
number  of  dots  between  each  two  adjacent  contours  is 
proportional  to  the  difference  in  AI.  It  will  be  noted 
that  at  an  over-all  level  of  88  dB  the  AI  equals  zero. 
The  8S  dB  is  derived  by  assuming  that  the  noise  band 
includes  the  octaves  centered  at  125,  250,  500.  1000, 
2000,  and  4000  cps  and  that  the  over-all  level  of  a  TX6 
noise  is  3  dB  greater  than  the  level  in  the  octave  with 
the  greatest  level — in  tins  case,  the  125  cps  octave.  AI’s 
of  0.2,  0  j,  0.8,  and  1.00  occur  at  over-all  levels  of  77, 
68,  58,  and  48  dB,  respectively. 

A  similar  relation  between  over-all  level  and  AI  can  be 
determined  by  using  the  method  shown  in  Fig.  2  with 
idealized  noise-spectrum  slopes  of  dz6,  —9,  and  — 12 
dB/oct.  From  this  type  of  information.  Fig.  3  has  been 
compiled.  Figure  3  is  a  plot  of  the  AI  versus  over-all 
noise  level  for  various  theoretical  noises.  Three  different 
over-all  bandwidths  of  noises  are  plotted  on  the 
abscissa:  (1)  to  the  extreme  left  is  the  over-all  level 
of  noise  in  the  octave  bands  centered  at  125,  25C,  -  -  -, 
4000  cps;  (2)  in  the  center  is  the  over-all  level  of  noise 
in  the  octave  bands  centered  at  500,  1000,  and  2009 
cps;  (3)  and  at  the  right  is  the  over-all  level  of  noise 
in  the  bandwidth  from  600  to  4SG0  cps  (or  for  octaves 
centered  at  S50, 1700,  and  3400  cps). 

It  is  evident  from  Fig.  3  that,  within  the  general 
region  of  AI’s  from  0.2-0.8,  all  noises  except  the 
-r  6-dB/oct  noise  have  the  same  general  slope  of  AI 
score  versus  over-ail  levels  cf  noise,  regardless  of  the 
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Fig.  3-  AI  as  a  ioactka  of  the  oitral!  fcvd  of  noise  for  aosser 
with  ideaEaed  spectrum  slopes  of  —12,  —9,  —6,  tilt,  and  -*-6 
dB/oct  (on  ^xdnoiJerd  loss)  or  i5,  —6,  ±3  dB/oct  (» 
ocUve-imd  basis).  The  ot-o^H  kv&  are  based  on  fcandwidths 
that  in chide  at  the  Idi:  a3  the  octaves  centered  from  125-1000 
cos;  easier;  octaves  centered  at  5001  1000,  and  2000  qis;  and  to 
the  rf;h:  octaves  ha  600-4800  qs  (centered  at  £50. 1700,  and 
3400  cps). 


bandwidth  in  which  the  ovcr-al*  ievri  of  noise  is 
measured.  As  compared  to  the  other  noise  yvectra, 
the  +6-dB/oct  noise  shows  a  sIottst  ircrcas:  in  AI 
with  decreasing  level  of  noise.  Data  to  support  this 
fact  have  been  shown  by  Pickett  and  Krvter  in  Fig.  1 
of  Ref-  2,  where  it  can  be  noted  that,  for  assumed  speech 
levels  of + 10  and  +20  above  reference  speech  level,  the 
ATs  for  the  predominantly  high-frequency  type  of 
noises  (typewriter,  TN  +6,  and  jet)  increase  much  less 
than  for  the  other  noises. 

It  is  also  immediately  evident  from  Fig.  3  that  reduc¬ 
ing  the  bandwidth  in  which  the  noise  is  measured  (from 
octaves  centered  at  125,  250,  -  - -,  4000  cps  to  octaves 
centered  at  500, 1000,  and  2000  cps)  reduces  the  spread 
in  the  noise  levels  for  an  equivalent  AI  value.  It  can 
also  be  seen  that,  if  predominantly  high-frequency-type 
noises  (+6  dB/oct)  are  to  he  included  in  any  predictive 
scheme,  the  bandwidth  centered  on  octaves  at  500, 
1000,  and  2000  cps  is  superior  to  the  older  SIL-cal dila¬ 
tion  bandwidth  of  6O0-4SOO  cps.  Of  course,  since  the 
majority  of  everyday  noises  predominate  in  low- 
frequency  sounds  (are  not  of  the  +6-dB/oct  type), 
the  600-  to  4S30-cps  band  works  well.  The  results  shown 
in  Fig.  3  suggest,  however,  that  the  general  rule  for 
predicting  speech  discrimination  for  speech  masked  by 
noise  should  emphasize  the  rmdfrequcncy  regions 

'  J.  M.  FkkeU  and  K.  D.  Krvter,  “FreSuioa  of  Speech  InteJ- 
Egjbcatv  in  Xccse,"  AF  CaTabndgt  Res.  Ctr.  Tech.  Reps.  55-4 
(Jane  1955). 


(octaves  centered  around  500,  1000,  and  2000  cps)  aDd 
deempharize  frequences  outride  of  this  region. 

The  CFHW  dot-pattem  plot  is  a  method  of  display¬ 
ing  the  perceptually  important  part  of  the  statistical 
aspects  of  speech,  as  formulated  by  French  and  Stein¬ 
berg*  and  by  Bcranek*  from  many  listening  tests.  The 
importance  of  the  30-dB  dynamic  range,  the  200-  to 
6000-cps  bandwidth,  and  the  greater  contribution  to 
speech  intelligibility  of  speech  frequencies  around  2900 
cps  are  amply  displayed  by  the  CFHW  dot-pattern  plot. 
The  properties  of  noise  that  mask  speech  can  be  dis¬ 
played  by  plotting  in  other  forms  data  derived  from 
plots  like  those  in  Figs.  2  and  3. 

Figure  4,  for  example,  is  a  plot  of  the  spectra  of  the 
idealized  noises  at  levels  that  yield  an  AI  of  from  0-00 
to  0.05.  That  is,  noises  with  spectra  of  —12,  —9,  —6, 
fiat,  and  -J-6-dB/oct  slopes  at  the  levels  shown  in  Fig.  4 
will  decrease  the  intelligibility  of  conversation-level 
speech  to  zero.  It  the  over-all  levels  arc  lowered  to  the 
levels  shown  in  Fig.  5,  the  AI  is  increased  to  0.2,  which, 
tor  conversational-level  speech,  would  allow  50%  of 
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Fx-  4.  Scend-pressure  fcvd  in  octave  bands  for  an  A3  between 
OjOO  and  0.5  for  coavegatioaal-fevd  The  parameter  is  the 

slope  of  an  idea  ford  noise  spectrum  of  —12,  —9,  fiat,  and  -5-6 
dB/oct  (Sacred  oa  a  spectnim-kvel  basis  or  —9,  —6-  —3,  -f-3, 
and  4-9  dB/oct  as  plotted  here  oa  an  ociave-kvd  basis)’  The 
over-oil  fere!  of  the  noise  can  be  estimated  bv  addict  03  sB  to 
tie  greatest  octave-band  fevci  of  the  4-9-dB/oc^  1.0  dB  to  the 
greatest  octave-band  fcvd  of  tbs  -6-dB/oct  and  3.4  dB  to  ibe 
greatest  octave-band  level  d  the  ±3-dB/oct  (on  octave-fevel 
basis)  noises-  Calculations  are  based  oa  tbs  Cavanaugh  d  cl* 
dot-patters  display  of  tie  AL  The  SI  SO  csaloor  from  Fig.  1  has 
been  superimposed  for  ease  of  comparison. 


*  X.  R.  French  and  J.  C.  Sternberg.  “Factors  Goverahs:  u 
la'elSriHHty  of  Speech  Sounds,”  J.  Attest.  Soc.  As.  19. 90-U1 
(1947). 

*  I—  L.  EeranA,  "The  Design  of  Speech  Ccmnaatio: 
Systems,"  Itoc.  IRE  35,  SSO-S99  (1947). 
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PB  {phonetically  balanced)  words  to  be  correctly  recog¬ 
nized  (according  to  Kryter,33  Fig.  4).  Similarly,  suc¬ 
cessively  decreasing  the  levels  would  result  in  an  AI  of 
0J5  (Fig.  6),  0.S  (Fig.  7),  and  0.95-1.00  (Fig.  8).  Figure 
9  is  a  composite  of  Figs.  4-8,  with  interpolation,  to  show 
the  maximum  level  of  noises  of  various  monotonic 
spectrum  shapes  that  would  yield  the  designated 
ATs  for  speech  at  conversation  level. 

Because  Fig.  9  is  derived  from  theoretical  and  mono- 
tonic  spectra  muses  does  not  limit  its  application  to 
these  kinds  of  muses.  As  has  been  pointed  out  earlier 
in  tins  series  of  papas,3  to  get  the  best  predictive  value 
out  of  any  set  of  rating  contours,  the  contours  should 
avenge  through  spectra  that  are  irregular  in  shape. 
And,  far  predicting  speech  intelligibility,  this  average 
should  center  on  the  octaves  centered  at  590, 1000,  and 
2000  cps.  A  peaked  noise  spectrum,  like  voice  babble  or 
a  diesel  engine,  should  be  fitted  such  that  the  point  of 
the  peak  overshoots  the  contour  enough  so  that  the 
deviations  of  the  muse  spectrum  from  the  noise-rating 
contour  wifi  average  zero  at  the  frequencies  500,  1000, 
and  2000  cps.  When  used  in  this  manner,  the  AI  con¬ 
tours  in  Fig.  9  can  be  used  for  noises  that  occur  most 
commonly.  And,  of  course,  when  used  in  this  manner  the 
contours  are  actually  bong  used  as  a  graphic  means  of 
finding  a  SIL  based  on  octaves  centered  at  500,  1000, 
and  2000  cps. 
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mancr  m  cycles  kh  xsw 

FiC.  d.  Scord-pressure  level  ia  octave  binds  foe  an  AI  ci  0L5  fee 
anRm&ailkiid  speed.  The  paiatess  are  as  dmiVd  ia 
Fig.  4.  The  SI 60  coatocr  msa  Fig.  1  ins  been  sepoizepoeed  fee 
ease  of  cotapansoe. 

Regardless  cf  rite  fact  that  they  can  be  used  on  most 
noises,  the  shape  of  the  contours  in  Figs.  4-9  are  deter¬ 
mined  by  the  stcctrum  slope  of  the  noses  used  to 
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Fsc.  8-  Scond-jOTSsane  Vrrf  is  octave  toads  fee  as  AI  Ictita 
0.95  xad  3-00  lor  acras&caMnd  y«3i  Toe  pansaeltrs 
xzc  is  dm-Jed  in  rig.  4.  The  NC30  ccalccir  frta  F15. 1  fcisbeea 
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generate  the  data.  Therefore,  it  should  he  considered 
whether  the  — 12-  to  -J-6-dB/oct  slopes  encompass  most 
com  monly-occmring  noises,  and/or  what  happens  to 
the  contours  if  more-  or  less-extreme  slopes  are  chosen. 
The  —12-  to  +6-dB/oct  slopes  certainly  seem  tc  cover 
the  majority  of  naturally  occurring  noises  reported  in 
the  literature,  at  least  those  reported  in  the  references 
in  this  paper.  And,  except  for  specific  cases  ct  anustads 
and  narrow  bands  of  noise,  these  spectrum  slopes  in¬ 
clude  most  laboratory-generated  crises. 

However,  to  get  an  idea  of  what  changes  in  contour 
shape  would  occur  if  mere-extreme  noise  slopes  were 
encountered,  consider  some  of  the  details  in  Fig.  5  in 
which  the  gamut  of  slopes  is  covered.  To  define  the 
Emits:  a  high-frequency  er  high-passed  noise  with  an 
infinite  slope  (vertical  cutoff)  would  pas  vertically 
th rough  a  frequency  of  S00  cps;  an  infinitely  sloped 
low-passed  (Sow-frequency)  noise  would  pas  through 
the  frequency  2500  cps.  That  is,  2&?c  of  the  dots  in  the 
CFHW  dot-pattern  plot  lie  at  or  below  800  cps  or  at 
cr  above  2500  cps.  Slopes  from  roughly  ±20  to  ±40 
dB/oct  center  between  1000  and  1250,  as  do  the  more 
gently  sloped  noises.  So  it  is  not  so  much  the  slope  of 
the  noise  spectrum  but  the  value  of  AI  that  determines 
the  center  frequency.  Of  erase,  ultimately  it  is  the 
characteristics  of  speech  that  account  for  tHc,  not  so 
much  the  speech  spectrum  but  the  width  (actually  the 
narrowness)  of  the  bands  that  cant  ributt  equally  to  the 
intdSpbffity  of  speech  (the  concentration  of  dots  gq 
Kg-  2). 


On  Fig.  9,  it  is  apparent  that  as  the  AI  increases  from 
0  to  1,  the  crossover  (or  minimum  value)  frequency 
shifts  from  about  630  to  about  2500  cps.  This  again  is 
not  dependent  on  the  limiting  slopes  of  the  extreme- 
noise  spectra.  Had  the  theoretical  noises  been  limited 
to  fiat  thermal  noise  and  —  6-dB/oct  thermal  noise, 
the  same  shift  upward  would  have  been  evident.  Note, 
for  example,  where  the  “fiat”  and  “—6”  noises  cross 
on  Figs.  5-7,  representing  ATs  of  0.2,  0.5,  and  0.8: 
namely,  at  about  1000. 1500,  and  2000  cps,  respectively. 

Although  noises  with  slopes  of  30-50  dB/oct  are  not 
common,  noise-induced  bearing  losses  occur  with  such 
extreme  slopes.  And  high-frequency  hearing  losses  limit 
the  nearing  of  low-lcvd  high-frequency  speech  sounds 
in  much  the  same  way  as  a  high-passed  masking  noise. 
In  Fig.  7,  therefore,  a  series  of  3  lines  representing 
noise  spectra  that  would  produce  simulated  hearing 
losses  at  rates  of  from  30  to  50  dB/oct  is  drawn  in  at  the 
position  in  frequency  that  would  correspond  to  an  AI 
of  0-8-  A  curve  representing  a  conservative  estimate 
of  the  sound-pressure  levels  for  the  threshold  of  audi- 
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bill  tv51  is  also  drawn  in  Fi".  7  so  that  the  30-  to  50- 
dB/oct  slope  can  be  roughly  interpreted  in  terms  of 
hearing  level  (Le.,  loss  from  normal  hearing).  To  the 
extent  that  hearing  level  can  be  simulated  by  masking 
noise  cf  comparable  level,  it  can  be  seen  that  the  A I 
scores  of  0.S  can  theoretically  be  achieved  bv  persons 
with  (1)  normal  hearing  up  to  1000  cps  and  simulated 
hearing  losses  of  15  dB  at  2000  cps  ana  2S  dB  at  4000 
cps,  or  (2)  normal  up  to  1500  cps  and  losses  cf  10  dB 
at  2000  cps  and  45  dB  at  4000  cps,  or  (3)  (by  definition) 
normal  up  to  2500  cps  and  an  infinite  loss  above  2500 
cps. 

Also  drawn  in  Figs.  4-S  are  the  SI  or  the  XCA  or  XC 
curves  from  Fig.  1  that  meet  nearly  approximate  the 
given  level  of  AI.  Note,  for  example,  that  the  SIS0  and 
60  correspond  roughly  to  ATs  of  0.0  and  0.5  and  that 
SI70  corresponds  quite  dcsely  to  an  AI  of  0.2.  The 
agreement  of  XCA40  to  an  AI  of  0.S  and  XC30  to  an 
AI1.00  is  not  nearly  as  good  and  it  was  not  intended 
that  they  should  agree.  These  latter  XC  contours  are 
for  rating  the  acceptability  of  offices  where  speech 
intelligibility  is  only  one  aspect  ef  the  total  noise 
environment. 

In  summary,  it  is  quite  evident  that  the  SI  contours 
(derived  from  experimental  results)  and  the  AI  con¬ 
tours  (derived  from  using  idealized  noise- spectra  and  the 
CFHW  display  of  the  AI  formulation)  agree  in  many 
de tails;  the  agreement  would  be  improved  by  inter¬ 
polating  a  few  more  SI  contours.  This  should  :*o*  be  too 
surprising,  inasmuch  as  the  AI  formulation  as  well  as 
the  SI  contours  were  based  os  experimental  listening 
results.  The  degree  of  agreement  could  be  improved  by 
interpolating  a  few  more  contours  in  the  region  between 
the  XC30  and  the  SI60  contours,  by  reexp’oring  the 
statistics  of  occurrence  of  naturally  occurring  noises,  or 
by  essentially  rerunning  the  16  original  Khnnpp  and 
Webster1  noises  at  oZhir  levels  of  speech  and/or  noise 
assd/or  inreUigibflay.  The  agreement  is  not  intended  to 
be  peried  in  any  case  because  the  SI50  and  60  contours 
are  meant  to  be  inierpoiaiions  between  the  XC30/ 
NCA40  curves  ffor  an  ideal  (XC30)  or  nearly  ideal 
(XCA40)  tavironment]  and  the  SI70  contour  known 
to  predict  speech  inidligibiiity  well  at  relatively  high 
(for  offices)  noise  levels.  However,  the  agreement,  as 
fax  as  it  now  goes,  between  the  AI  and  SI  contours  shows 
that  the  AI  iarmuiaiion  (based  solely  on  the  filtering  uf 
speech)  does  hold  within  certain  limitations  when 
applied  to  noise-masked  speech. 

JL  DISCUSSION  OF  THE  ARTICULATION  INDEX 

The  seal  question  is.  “What  are  the  strengths  and 
weaknesses  of  the  AI?”  Kiyteri*  has  shown  that, 
within  the  data  of  any  one  experimenter  (or  any  esse 
group  of  experimenters),  the  AI  very  nicely  takes 

=  J.  C  R-  L5j£5ott,  ia  Ircsaws  <f  Espezmccis I  ibjcfa&xr. 
S.  S.  Sresesa,  Ed.  QtSst  WSey  i  Secs,  lac.  New  York,  1951), 
Oap  23b  p.  995,  Fig- 5. 
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F 33-  10-  Nonsense  syflaUes  and  PB  word  scores  for  given  ATs 
froze  the  data  cf  J.  M-  Pidfcett  and  I-  Pollack  FJ-  Aoxst  Soc. 
An.  30,  955-963  (195S)J  Kmer,«  Miller  {TSvxhoL  BtzIL  2-M, 
1C5-129  (1947)3,  J.  P.  Egan  and  F.  M.  Wiener  fj.  Acoust.  Soc.  Am. 
IS,  435-441  (1946)3,  ar*d  French  and  Stdnlog,*  as  summarized 
by  Emcr.a 

account  of  the  physical  levels  of  filtered  but  otherwise 
unprocessed  speech  and  noise  and  reduces  to  one  param¬ 
eter  (AI)  a  monotonic,  low-variance,  band-limited 
predictor  of  word  (or  syllable,  sentence,  etc)  intel¬ 
ligibility-  However,  he  also  collects  into  one  paper59 
the  data  to  show  that  for  any  given  AI  the  PB  word 
score  or  nonsense-syllable  score  varies  over  a  40% 
range  (see  Fig.  10). 

Earlier,  Licklideri-'  had  also  summarized  the  dis¬ 
crepancies  between  experimental  data  and  AI  pre¬ 
diction;  be  stated,  “When  comparisons  were  made 
between  two  speech  communication  systems  with 
equal  (computed)  articulation  indices,  one  having 
high  spcech-noisc  ratio  and  low  bandwidth  and  the 
other  low  speech-noise  ratio  and  high  bandwidth,  the 
wide-hand  system  usually  turned  in  the  better  measured 
performance.”  Concerning  the  important,  or  center, 
frequency  for  high-  and  low-pas?  'Altered  speech  in  the 
quiet  (which  is  variously  listed  as  being  between  1600 
and  1900  cps  and  is  the  frequency  that  divides  the  AI 
into  two  equal  parts),  Licklidtr  reiterates  (from  the 
data  of  Folia ck51)  what  has  already  been  demonstrated 
ceariy  in  this  paper  and  in  the  Khnnpp  and  Webster 
paper,1  on  which  it  is  based,  that,  “In  the  tests  in  which 
masking  and  filtering  were  combined,  the  high-pass 
and  low-pass  functions  did  not  show  the  kind  of 
symmetry  just  described:  namely,  around  a  certain 

=J.CR. lidaca, "Text*  Aa&lcer Theories," is  Psywic^s: 
.4  Sjiir  tf  a  Sdcczt,  S.  Koch,  EL  (McGiasr-H33  Book  Ox,  lac, 
Xtw  York,  1959).  _ 

—  L  PoSttck,  “Estets  co  High  Pzes  and  Loa  "sss  fDtesisg  oa 
tie  IsutEpicEsr  cs  Speech  ia  Xcese,"  J.  Accost.  Soc.  Am.  2®, 
259-266  (1945)." 
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frequency  of  1600  to  1900  cps.  The  discrepancy  was 
marked.”  The  point  of  symmetry  shifted  successively 
downward  in  frequency  as  the  speech-noise  ratio 
deteriorated.  Concerning  the  relation  between  AI  and 
different  speech  materials,  Licklider  states,  “We  have 
been  assuming  that  the  articulation  score  for  each 
category  of  speech  material  had  its  own  fixed  relation 
to  articulation  index....  But  when  Hirsh,  Reynolds, 
and  Joseph1'41  plotted  their  word  scores  against  their 
nonsense  syllable  scores,  they  found  that  they  had  to 
draw  separate  turves  not  only  for  the  various  categories 
(numbers  of  syllables)  but  also,  within  each  category, 
for  filtering  and  masking.”  Finally,  concerning  the 
relation  between  AI  and  different  spectrum  noises, 
Licklider  quoted  the  results  of  Pickett  and  Kryter7 
to  the  effect  that  for  four  different  noises  the  intel¬ 
ligibility  scores  for  equal  AI  scores  fell  along  four 
different  curves. 

Briefly  resummarized,  the  defidences  of  the  AI  are 
that:  the  frequency'  and  level  bands  are  not  linearly 
additive  fas  Licklider11  states,  “. .  .short  fat  (low 
S/N,  wide  bandwidth)  regions. .  .yield  higher  scores 
than  tall  thin  (high  S/N,  narrow  bandwidth)  regions 
of  equal  area...”];  equal  AI  scores  yield  different 
speech  scores  dependent  upon  (1)  the  types  of  speech 
materials'4  and  (2)  the  spectrum  of  the  noise1-7;  and 
the  “importance  frequency'”  lowers  with  adverse  listen¬ 
ing  conditions.11 

Although  Kryter1*  has  added  some  corrections  to 
minimize  certain  of  the  defidendes  of  (and  to  extend 
greatly  the  usefulness  of)  the  AI,  certain  defidences  are 
still  inherent.  It  is,  therefore,  not  a  condemnation  of 
any  proposed  system  of  estimating  speech  intelligibility* 
that  it  does  not  agree  completely  with  the  AI  calcula¬ 
tion.  In  any  case,  in  the  region  of  0.2  AI  where  the  5170 
(SI  contour  at  the  70-dB  level)  was  based  on  direct 
data,1-*  the  agreement  between  AI  and  SI  is  very  good 
(see  Fig.  5).  Of  course,  part  of  the  agreement  is  because 
the  theoretical  noises  chosen  to  plot  the  AI  contours 
(Figs.  2,  4-9)  encompass  the  spectra  of  the  real  noises 
of  Klumpp  and  Wehster1  upon  which  the  basic  SI 
contour  at  the  70-dB  level  (SI70)  was  developed.  And 
the  relatively  large  discrepandes  between  the  AI  and 
the  51  and/or  NC  or  NCA  curves  at  lower  levels  of 
noise  in  Figs.  6-8  represent  a  deliberate  compromise 
between  the  speech-interfering  aspects  of  the  noise  and 
its  loudness  and  annoyance  aspects: 

HI.  POTENTIAL  USES  FOR  THE  SPEECH- 
INTERFERENCE  CONTOURS 

It  is  too  early  to  determine  the  usefulness  and  limita¬ 
tions  of  the  SI  contours.  A  few  uses  have  been  found. 
For  example,  a  frequency-limiting  network  has  been 

*  L  J.  Hash,  E.  G.  Reynolds,  and  M.  Joseph,  “InteEdbility  of 
Different  Speech  Materials,”  J.  A  coast.  Soc.  Am.  26,  530-559 
(1954). 

K.  D.  Kryter,  “Methods  for  the  Calculation  and  Use  of  the 
Artxolatba  lades,”  J.  A  coast.  Soc.  Am  54,  16S9-1697  (1962). 


built  to  the  inverse  specifications  of  the  SI70  contour. 
When  used  in  a  sound-level  meter  to  measure  the  16 
equally  sireech-interfering  noises  of  Klumpp  and 
Webster,1  it  gave  more-homogeneous  measurements 
than  any  existing  sound-level-meter  filter  network.* 
Or,  restated,  it  appears  to  be  a  good  network  to  insert 
into  a  sound-level  intter  if  the  SI  effects  of  noises  are 
to  be  measured. 

A  penal  and  paper  calculation  has  been  made  to 
determine  if  this  “speech  interference”  filter  used  in 
conjunction  with  the  already  existing  “A”  network  in 
sound-level  meters  might  predict  the  acceptability  of 
the  noise  environment  in  a  potential  office.  This  cal¬ 
culation  used  the  noises  and  rationale  of  Beranek*  in 
the  development  of  the  NC  contours.  In  his  paper, 
Beranek  (Ref.  5,  p.  23)  condudcs  that,  “A  majority  of 
the  [[office]  personnel  object  to  a  noise  whose  ocutve- 
bor.d  levels  at  the  low  frequendes  exceed,  for  a  given 
SIL  [[speech-interference  level,  dedbel  average  of  oc¬ 
taves  in  the  600-  to  4S00-cps  range],  the  values  given  by 
the  NCA  curves,  even  when  the  SIL  is  low. .  .objections 
occur  whenever  the  LL-minus-SIL  difference  exceeds 
30  units.” 

These  observations  are  supported  by  data  in  his 
Fig.  5  (Ref.  5),  which  are  summarized  in  this  paper  in 
Table  I,  columns  1-5.  Column  1  lists  Beranek’s  8  noises. 
In  column  2,  a  B  represents  a  noisy  room  “before” 
sound  treatment;  A  designates  “after.”  Columns  3-5 
are  taken  directly  from  Beranck’s  Fig.  5  and  super¬ 
script  “a’s”  indicate  rooms  judged  unsatisfactory  before 
(LL-SIL>30)  but  satisfactory  after  (LL-SIL<22), 
on  the  basis  of  the  LL-minus-SIL  criteria.  In  columns  6 
and  7,  calculations  based  on  the  spectra  given  by 
Beranek*  are  made  for  A  weighting  (column  6)  and 
SI70  weighting  (column  7).  That  is,  columns  6  and  7 
are  the  best  estimates  of  what  a  sound-level  meter 
using  A  and  5170  weightings  would  have  measured. 
Note  that  in  general  if  the  difference  between  columns  3 
and  4 — namely,  column  5-  »s  equal  to  or  greater  than 
30,  the  difference  between  columns  6  and  nar..rlv, 
column  8 — is  equal  to  or  greater  than  5.  It  would  appear, 
therefore,  that  if  a  sound-level  meter  had  two  weighting 
networks  available,  A  and  SI70,  a  preliminary  judgment 
could  be  made  as  to  the  acceptability  of  the  noise 
environment  of  a  room  by  noting  the  difference  in 
reading  between  the  two. 

Beranek  (Ref.  5,  p.  24)  warns,  “//  is  not  recommended 
!hai  A-scalc  readings  be  used  in  specifications  because 
tkc  same  A-scaiC  reading  may  be  obtained  for  a  sride 
rarieiy  of  shapes  of  spedra.  Furthermore,  the  eight  octare 
bands  are  necessary  in  ike  engineering  design  of  noise 
control  measures  and  no  single  number  can  substitute 
By  using  tire  difference  between  an  A  and  an  SI70  scale, 
some  of  BeranckTs  objections  to  a  single  number  are 
overcome.  And,  since  the  SI70  scale  discriminates 
against  both  low  and  high  frequencies  more  than  the 
A  scale,  some  of  Bcranck’s  objections  re  "...  the 
same. .  .reading. .  .for  a  wide  variety. .  .of  spectra ” 
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can  be  discounted.  Both  high-  and  low-frequency  sounds 
are  judged  to  be  “annoying,”  though  they  are  not 
particularly  “speech  interfering.”  There  can  be  no 
objection  to  Beranek’s  statement  as  to  the  necessity  of  a 
complete  eight-octave  spectrum  for  noise-control  pur¬ 
poses;  but,  for  preliminary  survey  purposes,  potentially 
objectionable  noise  environments  may  be  found  by 
sound-level-mcter  measurements  alone  if  an  SI70  scale 
is  added  (perhaps  to  replace  the  B  scale). 

Beranek  (Ref.  5,  p.  24)  also  points  out  the  obvious 
fact  that  “Complaints  were  also  registered  (room  f) 
[Ref.  5,  Table  I,  noise  f]  when  the  LL-minus-SIL 
difference  was  less  than  30  units  but  the  SIL  [60  >  - 
4800  cps)  exceeded  acceptable  values.”  Noises  b  and  f 
marked  by  superscript  b  are  examples  of  this,  and 
acceptable  improvements  occurred  when  the  SIL  values 
(column  4)  were  substantially  reduced.  The  same 
improvement  could  be  measured  by  substantial  reduc¬ 
tions  in  A  of  SI70  scale  readings  (columns  6  and  7)  or 
by  an  SIL  based  on  the  octaves  300-2400  cps  (column 
9). 

It  is  interesting  tc  note  in  comparing  columns  4  and  9 
(SIL’s  based  on  6QO-4SOC  vs  300-2400  cps)  that  on  the 
average  the  300-  to  2400-cps  SIL’s  are  5.25  dB  greater 
than  the  600-  to  4800-cps  SIL’s,  which  implies  that 
“typical”  office  noises  have  a  spectrum  slope  (when 
measured  in  octave  bands)  of  about  —5  dB/oct  in  the 
speech  range  from  300-4800  cps.  And,  significantly, 
the  acceptable  ones  slope  —4.5  dB/oct  and  the  objec¬ 
tionable  ones,  —6  dB/oct.  This  5-dB  difference  between 
the  300-  to  2400-  and  600-  to  4800-cps  SIL  is  not 
typical  of  all  noises.  The  difference  was  only  1.5  dB  for 
the  36  ship  noises  (Ref.  1,  Table  2)  but  was  2.5  dB 
when  a  weighting  was  added  to  correct  for  how  often 
the  16  noises  probably  occurred  in  a  ships’  environment. 

Other  comparisons  on  Table  I  show  the  SI70  weight¬ 
ing  to  read  on  the  average  1.6  dB  higher  than  the  SIL 
(600-4800  cps)  and  3.6  dB  lower  than  the  SIL  (300- 
2400  cps),  which  makes  it  a  pretty  good  prediction  of 
the  recommended  compromise  SIL  based  on  octaves 
centered  at  500,  1000,  and  2000  o*1-2 

Using  the  data  of  Table  I  as  a  first  approximation  for 
setting  criteria  using  the  A  and  the  SI70  network,  it 
would  appear  that  the  noise  environment  in  an  office 
wifi  be  judged  unsatisfactory  if  the  sound-Jevel-meter 
reading  using  the  SI70  filter  network  is  50  dB  or  above 


Table  I.  Noise  ratings  in  dB  for  Beranek's  [Nois:  Control  3, 
No.  1, 19—27  (1957)]  eight  office  noises.  LL:  loudness  level.  SIL: 
speecn-interference  level.  A,  SI70:  calculated  sound-level-meter 
readings  when  using  an  A  or  an  inverse  SI70  frequency-weighting 
network. 
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•Judged  unsatisfactory  before  and  satisfactory  after  treatment. 
•■Judged  unsatisfactory  (because  SIL  is  too  large). 


or  if  the  difference  between  the  A  reading  and  the  SI70 
reading  is  5  dB  or  above. 

Until  such  times  as  some  data  are  collected,  using 
the  whole  series  of  either  the  AI  contours  or  the  SI 
contours  for  rating  noises,  no  statement  can  be  made 
as  to  their  practical  usefulness.  The  value  of  their  use  in 
better  understanding  the  relationships  between  noise 
spectra  and  levels  and  the  masking  of  speech  has  been 
the  purpose  of  this  presentation. 
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Introduction 

When  is  a  hearing  loss  an  impairment,  a 
handicap,  or  a  disability?  What  is  the  dif¬ 
ference  between  impairment,  handicap,  and 
disability?  When  is  monetary  compensation 
due  if  the  hearing  loss  has  been  caused  by 
military  or  industrial  noise?  These  problems 
were  discussed  at  the  October,  1963,  meet¬ 
ing  of  a  symposium  sponsored  by  the 
National  Research  Council-Armed  Forces 
Committee  on  Hearing  and  Bio  Acoustics 
(  Cl  IABA),  which  was  chaired  by  Hallow-ell 
Davis, 

Many  things  were  discussed,  but  Dr. 
Davis  tried  to  bring  some  order  out  of 
chaos  by  proposing  that  hearing  '’impair¬ 
ment"  be  considered  the  least  noxious  des¬ 
criptor  anil  be  used  to  define  a  “defective 
function.”  He  then  defined  a  hearing  “hand¬ 
icap"  as  being  sufficient  to  “reduce  one’s 
efficiency  in  daily  living"  or  put  one  at  a 
social  disadvantage.  And  he  defined  a  hear- 

Siilunilted  for  publication  Dec  24,  1963. 

United  States  Navy  Electronics  Laboratory. 


ing  “disability”  as  the  most  noxious  and 
sufficient  “to  reduce  one’s  earning  power.” 

Whether  compensation  was  payable  upon 
the  discovery  of  an  “impairment,"  proof 
of  a  “handicap,”  or  only  when  a  job  change 
was  necessary  (“disability”)  was  not  dis¬ 
cussed  directly  since  these  were  medical 
men  and  audiologists,  not  lawyers  and  in¬ 
dustrial  (labor-management)  representa¬ 
tives. 

The  question  of  how  best  to  measure  and 
quantify  the  degree  of  hearing  impairment 
for  speech  was  discussed  loud  and  long 
with  two  jKjints  of  view  being  represented: 
the  present  method,  and  a  proposed  change 
sponsored  by  Dr.  Karl  Kryter. 

Dr.  Davis  traced  the  history  of  the  present 
method  of  calculating  “hearing  impairment 
for  speech”  from  the  average  of  pure  tone 
losses  at  500,  1,000,  and  2,000  cps.  The 
total  history  behind  this  system  dating  back 
to  Dr.  Harvey  Fletcher  and  Dr.  Edmund 
Fowler  will  not  be  belabored  here.  Suffice 
it  to  say  that  the  present  method  was  ar- 
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rive<l  at  by  the  Committee  on  Conservation 
of  Hearing  sponsored  by  the  American 
Academy  of  Ophthalmology  and  Otolaryn¬ 
gology  (hereafter  referred  to  as  AAOO). 
The  AAOO  committee  started  on  the 
premise  that  “Because  of  present  limitations 
in  speech  audiometry,  the  hearing  level  for 
speech  should  be  estimated  from  measure 
ments  made  with  a  pure  tone  audiometer.”  1 

Dr.  Kryter  presented  the  data  from  two 
of  his  recent  papers 2:1  to  support  his 
reasons  for  a  change  in  calculation  method. 
In  his  recent  pajier,8  Kryter  recommended 
“that  the  impairment  due  to  noise-induced 
hearing  loss  for  the  understanding  of  speech 
l>e  estimated  from  the  average  hearing  level 
at  1,000,  2,000,  and  3,000  cps."  Remember 
that  the  present  (AAOO)  method  uses  the 
average  pure-tone  hearing  level  at  500, 
1,000,  and  2,000  cps  as  the  basis  for  deter¬ 
mining  impairment  for  everyday  speech 
under  everyday  listening  conditions,1  for 
all  tv|>cs  of  hearing  losses. 

Kryter’s  Rationale 

The  actual  content  of  Kryter’s  CHABA 
presentation  will  not  be  repeated  here;  it  is 
well  documented  in  two  pajiers.2,3  Only 
his  hv]iotheses  and  rationale  will  l>e  dis¬ 
cussed. 

Kryter’s2  hypothesis  is  that  evidence  for 
the  500-2,000  cps  “fence”  is  based  on  speech 
threshold  tests  (usually  called  Sj>eech  Re¬ 
ception  Tests  and  abbreviated  SRT), 
whereas  it  is  speech  discrimination  tests 
(usually  measured  with  phonetically  bal¬ 
anced  word  lists  and  abbreviated  PB)  which 
are  im|»ortant.  Actually  it  is  neither  PPi 
discrimination  nor  SRT  that  is  the  crux 
of  the  matter,  but,  as  the  AAOO  committee 
states  in  its  report,  “[it  is]  the  ability  to 
hear  sentences  and  repeat  them  in  a  quiet 
environment  [that]  is  taken  as  satisfactory 
evidence  of  correct  hearing  for  everyday 
speech.”  There  is  no  argument  that  speech 
(PB)  discrimination  at  supra-threshold 
levels,  and  especially  in  a  background  of 
noise,  is  a  far  more  valid  test  of  speech 
intelligibility  than  is  speech  threshold,  but 


neither  measures  “everyday  speech”  in 
“everyday  conditions.” 

Kryter  2  stated  that  his  “.  ,  .  recommenda¬ 
tion  [of  a  1,000,  2,000,  and  3,000  cps  fence] 
has  the  support  of  related  studies  [of  Mul¬ 
lins  and  Bangs 4  and  French  and  Stein- 
bergr']  and  is  contested  only  by  those 
experiments  [by  Quiggle,  Glorig,  and 
Summerfield*  and  Harris,  Haines,  and 
Meyers7]  in  which  .  .  .  threshold  .  .  .  tests 
. . .  for  . . .  words  . . .  were  used." 

The  discussion  at  the  CHABA  symposium 
took  four  forms:  criticism  of  Kryter's 
paper,  presentation  of  contrary  evidence,  a 
discussion  of  where  on  the  impairment- 
handicap-disability  scale  the  results  of  all 
studies  fell,  and  how  indeed  does  one  meas¬ 
ure  “handicap”  in  particular. 

This  paper  is  not  intended  to  be  a  com¬ 
plete  transcript  of  the  CHABA  symposium 
so  only  two  aspects  of  the  discussion  will 
be  pursued:  Some  general  criticisms  of 
Kryter’s  exjicriment  and  some  data  of  my 
own  (which  were  also  presented  at  the 
CHABA  symposium). 

Speech  Discrimination  vs 
Speech  Threshold 

One  of  Kryter’s  major  hypotheses  is  that 
PB  tests  are  more  typical  of,  or  at  least 
better  predictors  of  “everyday  speech  in 
everyday  environment”  than  are  SRT  tests. 
In  a  factorial  analysis  study  of  speech 
perception,  Hanley  8  did  indeed  find  separate 
factors  for  thresholds  (including  500,  1,000, 
2,000,  and  4,000  cps  tones  and  sentences, 
spondee,  nonsense  syllable  and  PB  speech) 
and  for  "resistance  to  distortion”  and  for 
“resistance  to  masking.”  Both  of  Han¬ 
ley’s  “resistance  to-”  factors  included  tests 
which  made  "normal  ears”  hear  speech 
as  it  might  be  heard  by  partially  deafened 
individuals.  Solomon,  Webster,  and  Curtis  0 
also  found  separate  speech  threshold  factors, 
a  "distortion  or  masked”  factor,  and  a 
distraction  (selective  attention)  factor. 
Peculiarly  enough,  the  speech  threshold 
factor  included  sentences,  spondees,  and 
nonsense  syllables  but  not  PB  words, 
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whereas  the  masking  factor  included  I’ll 
words,  but  not  sendees  (the  basis  of  SRT 
tests).  Neither  ti  e  Hanley1*  nor  the  Solo¬ 
mon,  Webster,  and  Curtis9  studies  bad 
hard-of-hearing  subjects,  but  the  test  lot¬ 
teries  included  tests  which  tenij»orar«ly  made 
normal  hearing  subjects  hear  as  if  they 
were  partially  deafened  in  one  form  or 
another. 

In  studies  where  both  normal  and  hard- 
of-hearing  subjects  were  used,  somewhat 
different  results  obtain.  Ross.  Huntington, 
Newby,  and  Dixon  10  studied  the  relation¬ 
ships  between  pure-lone  audiometric  meas¬ 
ures  and  PI?  scores  for  groups  of  normal 
and  sensorineural  subjects.  They  included 
tests  of  difference  limen  (DI.)  tor  intensity 
and  frequency  but  found  that  “.  .  .  the  only 
factor  which  appeared  to  be  related  tc  speech 
intelligibility  was  the  extent  or  configuration 
of  the  hearing  loss.”  They  further  added 
that  “the  subjects  with  high-frequency  hear¬ 
ing  losses  demonstrated  less  relative  effect  of 
noise  u*x>n  their  discrimination  scores  than 
did  subjects  with  flatter  pure-tone  threshold 
configurations.” 

Mullins  and  Bangs 4  correlated  some 
measure  of  the  hearing  levels  at  5CG.  1,000, 
and  2,000  cps  to  PB  scores  and  found  a 
high  correlation.  Unfortunately,  they  did  not 
correlate  PB  scores  with  the  masking  index 
at  1,000.  2,000,  and  3,00 0  cps.  This  S3me 
study  did  show  the  correlation  between  PB 
scores  and  3,000  cps  to  be  numerically 
slightly  greater  than  that  between  PB  and 
2.00G  cps.  and  for  this  reason  this  study  is 
quo-v,:  by  Krytcr  as  supporting  the  1,000. 
2,0l3.  3.000  cps  fer.-.e. 

Ncither  of  these  studies  in  them -elves  sup¬ 
port  either  the  500.  1,000,  and  2,000  cps  or 
the  1,000,  2.000,  and  3.000  cps  fence  s’mee 
within  any  single  set  of  data  the  two  fences 
were  not  compared  to  each  other. 

Elliott 11  correlated  PB  scores  to  many 
other  audiometrir  measures  including  SRT 
scores  and  the  degree  of  the  difference  in 
hearing  level  at  adjacent  pure-tone  fre¬ 
quencies  and  found  that  for  two  of  her  three 
samples  “. . .  SRT . . .  was  the  best  single  pre¬ 


diction  of  I’ll  score. . . .”  In  the  third  sample 
*\  .  .  PB  score  in  the  non-test  ear  .  .  was 
best.  Although  Elliott  did  not  correlate  aver¬ 
age  loss  at  500.  1.000,  and  2,000  cps  or  at 
1,000,  2.000.  and  3.000  cps  with  I*B  score  in 
her  original  rejiort,  she  has  subsequently 
done  so.12  She  finds  that,  “Both  averages  cor¬ 
relate  negatively  with  PB  scores  —  (and)  — 
this  negative  correlation  is  significantly  dif¬ 
ferent  from  zero.  .  .  ."  She  further  finds 
Shat,  “Prediction  of  PB  scores  for  listeners 
with  normal  hearing  is  not  significantly  dif¬ 
ferent  from  zero  for  either  average.”  In 
general  there  was  no  significant  difference 
lietween  the  coefficients  of  PB  score  vs  500. 
1,000,  and  2,000  cps  and  PB  score  vs  1,000, 
2,000,  and  3,000  cps.  For  one  of  her  ten 
groups,  however,  a  difference  between  0.66 
(PB  vs  500.  1,000.  and  2,000)  and  -0.78 
(PB  vs  1,000.  2,000,  and  3,090)  was  sig¬ 
nificant  at  the  0.05  level  of  confidence.  There 
is  here,  therefore,  a  thread  of  evidence  fa¬ 
voring  the  higher  frequency  fence. 

Critique  of  Kryter's  Data 

The  Kryter.  Williams  and  Green3  paper 
on  which  Kryter 2  liases  his  recommenda¬ 
tions  does  make  comparable  measures.  How¬ 
ever,  there  is  one  potentially  weak  link  in 
that  paper3  which  throws  an  element  of 
doubt  into  the  generality  of  the  conclusions. 

The  potential  weak  point  in  the  Kryter. 
Williams,  and  Green  3  paper  has  to  do  with 
the  correlation  technique  used  in  arriving  at 
the  conclusions.  A  correlation  coefficient  de¬ 
pends  critically  on  the  range  of  values  stud¬ 
ied.  It  is  apparent  from  Fig  1  of  Kryter 
ct  a! 1  ( which  is  reproduced  as  lug  1  in 
Kryter3)  that  both  the  range  of  hearing 
losses  and  the  number  of  cases  with  large 
hearing  losses  are  less  at  frequencies  lieiow 
2,000  cps  than  for  frequencies  above  2,000 
q>s.  For  this  reason  alone  correlation  co¬ 
efficients  between  anything  and  hearing  lev¬ 
els  of  3,000  cps  and  above  will  be  numerically 
larger  than  correlations  of  the  same  thing 
wit];  hearing  levels  of  2,000  cps  and  below, 
since  the  magnitude  of  a  product-moment 
correlation  coefficient  reflects  to  a  great  cx- 
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Fi>;  1  — Masked  audio¬ 
grams  due  to  low  frequen¬ 
cy  noi=c  spectra  equated 
to  !«*  equally  speech  in- 
UTieriin;  with  flat  spectra 
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tent  the  range  and  number  of  cases  distrib¬ 
uted  throughout  the  range  of  measurement. 
Tbi>  same  reasoning  could  explain  the  one 
case  out  of  ten  in  HHiott's  ,=  <lata  where  the 
1,000.  2.0G0.  3.000  cps  correlation  with  1*15 
score  was  significantly  greater  than  the  cor- 
rcsjxnjding  500.  1.000.  and  2.000  q>>  cor- 
relatiou. 

Krytcr  et  ai 1  are  aware  of  this  artifact 
and  they  j«nnt  it  out  in  comparing  their  data 
to  those  of  Harris  et  al,7  who  gel  much 
higher  correlations  between  <]>eech  thresh¬ 
old  tots  and  the  frequencies  500.  1,000,  anti 
2,000  cps  than  the  frequencies  3,000.  -1.000, 
and  6.000  cps.  Krytcr  cl  al 5  say.  “These 
differences  in  results  could  be  at  least  partly 
explained  on  the  basis  of  differences  among 
subjects  .  .  .  for  example  some  of  the  sub¬ 
jects  «»t  Harris  et  al 7  suffered  hearing  losses 
that  were  more  severe  than  any  in  our  group 
of  listeners.'’ 

I  f  Krytcr's  recommendation  is  confined  to 
noise-induced  hearing  hiss  subjects  and  if 
the  sample  he  portrays  tin  log  1  of  ref  2) 


is  typical  of  noise-induced  hearing  losses, 
then  the  criticism  is  not  jarticularlv  valid. 
His  method,  as  he  shows,  is  optimal  for  the 
sampl*  he  studied.  But  since  the  methyl  of 
tiie  AAOO's  Committee  on  Conservation  of 
Hearing  is  to  encomjass  all  types  and  shapes 
of  hearing  loss  due  to  any  cause,  the  <Iata  of 
Kryter  et  al3  should  be  generalized  with 
caution. 

Although  the  Kryter  et  al 3  sample  may 
reflect  the  distribution  of  noise-induced 
hearing  loss,  it  does  not  necessarily  represent 
the  distribution  of  all  kinds  of  hearing  loss. 
As  a  general  sample  from  which  Jo  draw  gen¬ 
eral  conclusions,  it  is  not  well  distributed: 
There  are  essentially  four  groups:  losses  of 
30  db  at  all  frequencies  (1)  of  500  cps  and 
above.  (2)  of  2.000  cps  and  above.  # 3 »  of 
3,000  cps  and  above,  and  (4)  of  4,000  cps 
and  above  (or  out  of  the  range  of  interest). 
There  is  a  gaping  hole  in  that  no  group  quali¬ 
fies  with  losses  at  1.000  cps  and  above.  If 
such  a  group  existed,  the  groups  would  be 
symmetrical  around  the  center  of  the  dis- 
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putc,  namely,  1,000  and  2,00rk  Thai  averag¬ 
ing  in  either  500  or  3,000  would  give  some 
positive  answer  as  to  whether  losses  at  500, 
1,000,  and  2,000  q>>  or  at  1.000.  2,000.  anti 
3,000  cps  would  best  predict  hearing  impair¬ 
ment  for  speech. 

If  Krvter's 2  basic  argument  is  sound, 
that  speech  discrimination  is  more  j>crtinent 
than  speech  threshold  in  determining  a  hear¬ 
ing  impairment  for  sjieech.  it  should  follow 
that  the  higher  frequencies  are  the  more  im- 
jiortant  ones.  But  does  the  dropping  of  500 
qis  for  3,000  cps  really  help  in  predicting 
speech  discrimination  loss?  Xote  the  last  col¬ 
umn  of  Kryter's  Table  I  2:  There  are  essen¬ 
tially  two  groups  of  scores.  54  and  above, 
and  40  and  below.  The  big  break  in  the  con¬ 
tinuum  of  speech  discrimination  comes  be¬ 
tween  those  groupings.  The  hearing  level 
discontinuity  that  dirides  those  groups  is  a: 
2.000  cps.  Both  the  500.  1.000.  and  2.000 
cps  and  the  1 ,000,  2.000.  3.000  cps  averages 
show  large  changes  at  this  breakover  jioint. 

This  same  phenomenon  is  evident  in 
Kryter’s  Mg  l.2  To  make  a  good  bipartite 
prediction  from  graphical  data  it  is  desirable 
to  have  two  horizontal  lines  at  different  lev¬ 
els  ( one  corresponding  to  acceptable  and  one 
nonacceptable)  connected  in  as  small  a  hori¬ 
zontal  spacing  as  possible  by  a  steeply 
sloped  line.  The  choice  jwint  is  essentially 
the  midpoint  on  the  sk>|«ed  line,  and  the 
steeper  the  slope  the  more  clear-cut  or  pre¬ 
cise  is  the  prediction,  or  *  decision.  On  Kry¬ 
ter’s  Mg  1  2  it  is  evident  that  choosing  a  5 
db  fence  at  500,  1.000.  and  2.000  q»s  gives 
exactly  the  same  prediction  as  choosing  a  15 
db  fence  at  1 ,000. 2.000.  and  3.000  cps.  And. 
in  fact.  Kryter  2  makes  this  exact  same  state¬ 
ment  when  he  writes.  .  .  maintaining  the 
15  db  fence  and  taking  hearing  level  as  an 
average  of  pure-tone  audiograms  at  1.000, 
2.000.  and  3.000  cps  is  roughly  equivalent, 
keeping  speech  intelligibility  constant,  to 
measuring  the  hearing  level  at  500.  1.000. 
and  2.000  cps  and  lowering  the  fence  by  10 
db.  that  is,  a  fence  of  5  db  ” 

Based  on  the  data  he  shows.2  his  rccon:- 
mendation  could  therefore  have  equally  well 
been:  maintain  the  500.  1.000.  and  2.000  cps 


average  but  lower  the  fence  from  15  db  to 
5  db.  which  i-.  in  fact,  stated  as  the  fourth, 
or  last,  conclusion  on  his  jnj>er.-  But  it 
should  lie  {minted  out  again  that  no  objec¬ 
tive  criterion  has  been  mentioned,  suggested, 
or  quoted  to  convert  Kryter’s  jxrcentagc  of 
sentences — or  words — correct  to  any  location 
o.i  the  imjciirnient-handicap-disability  scale. 
Until  someone,  or  some  committee,  cui  con¬ 
vert  j»er  cent  ,*B’s.  sentences,  or  any  other 
objective  sjievch  lest  to  degree  of  imjiair- 
ment-handicap-disabiiity.  it  is  jioinlless  to 
argue  whether  the  hequatcies  now  used  to 
predict  sj  >eech  threshold  from  pure  tone 
audiometer  should  lie  changed.  Kryter’s  own 
<lata  show  that  in  actuality  either  set  of  fre¬ 
quencies  are  aiiout  equally  good,  dejx-nding 
upon  what  decibel  level  is  used.  Any  t*nn- 
bination  of  frequenci-s  and  db  levels  (or 
losses)  converts  only  to  per  can  of  sonx- 
type  of  s}vcch  correct  ami  not  to  am  degree 
of  iropairmait-hamlicaji-disability. 

A  real  question  still  remains  to  U-  an¬ 
swered:  When  «k>es  a  hearing  im|«:n:x-nt 
for  sjieech  become  a  real  social  impairment  ? 
Beasley 22  asked  people  whether  they  consid¬ 
ered  themselves  to  hear  normally,  to  lie 
slightly  impaired,  etc.  ami  then  administered 
jiure-tonc  audiometer  tests.  He  found  that 
his  “minimally  impaired  group"  averaged 
hearing  levels  of  just  over  20  db  for  500. 
1.000.  ami  2.090  q«s  ami  only  jus;  slightly 
more  than  that  for  1.000.  2.G09.  and  3.000 
cps.  On  further  questioning  he  found  his 
minimally  imjsiired  group  had  difficulty  ". . . 
in  church,  at  the  theater  or  in  group  conver¬ 
sation.  but  .  .  .  not  ...  at  dose  range.  .  .  .’’ 
Is  Beasley’s  population  is  typical  of  today's 
pojnabtiorj.  j>crha}»s  a  15  db  fence  is  t*»o  low 
for  a  1.000.  2.000.  and  3.000  q»s  average, 
ami  nx..-J  certainly  a  5  db  fence  at  500.  1 .000. 
and  2.000  q»s  vvoubl  be  too  low. 

If  Kryter’s2  recommcmiation  of  a  15  dh 
fence  at  1.000.  2.000.  and  3.009  ct»s  fie  con¬ 
sidered  for  adoption  it  should  be  confined  to 
noise-induced  hearing  los>  cnse<  alone 
« which  is  what  he  recommended  >  and  it 
should  be  kq»t  in  miml  that  “hearing  im¬ 
pairment  for  speech"  may  be  defined  at  ;<»> 
low  a  level  (a  man  may  tie  clarified  a-  King 
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"hearing  imjoired  for  sjieech"  who  really, 
in  on:  noisy  world,  is  no!  socially  "impaired” 
or  "handicapped”). 

Continuing  on  the  Bea>!ey-ly|»c  study.  Dr. 
l.eu  Doerfler  and  Dr.  Grant  Fairbanks  re¬ 
turned  some  preliminary  works  of  their  own 
at  the  CHABA  symposium  on  social  handi¬ 
caps  as  related  to  some  measure  of  hearing 
loss.  This  is  where  much  effort  is  needed  at 
the  present  lime. 

At  this  point  in  the  CHABA  symposium 
the  present  author  was  invited  to  present 
<lata  showing  that  if  speech  is  masked  by  all 
varieties  of  noises  ( -imilar  in  many  respects 
to  sjieech  being  hearc  by  jieojde  with  all  types 
of  audiograms  j  it  is  tl»e  noise  in  the  octaves 
centered  at  500.  1.000.  and  2.000  that 
l»esi  describes  the  discrimination  loss  for 
sjieech.  The  gist  of  my  talk  follows. 

Important  Frequencies  for 
Speech  Perception 

If  consideration  is  given  to  generalizing 
Kn1crs:  higher-frequency  hearing  imjxair- 
uient  for  s|<eech  criteria  to  all  types  of  hear¬ 
ing  loss  cases  ami  in  particular  if  a  criterion 
for  calculating  hearing  impairment  for 
sjieech  is  dedred  that  can  be  generalized 
over  into  the  world  of  normal  hearing  peo¬ 
ple  in  |iatholcgkal  t  noisy  *  environments, 
then  cunsnleration  should  be  given  to  other 
tyjies  of  evidence. 

Klumpp  and  Wdwcr.  in  a  recent  serir-.- 
of  {?aj>ers.,<17  have  <kme  essentially  the  cra¬ 
ter -e  of  what  Krytcr  et  ai3  have  <k»ne. 
Whereas  Kryter  et  a! 3  gave  qwdi  discrimi- 
nai!<ei  !e3?  to  subjects  exhibiting  carious 
limited  kimSs  ami  «kgrccs  of  hearing  loss 
'lectta.  Klumpp  and  Webster 13  gave  sj*eech 
discrimination  tests  to  normal  hearing  sub- 
i«t'  in  16  diverse  kinds  of  noise.  In  ielh 
case'  subjects  heard  st»eech  with  {art  of  its 
']«cctnKii  gone,  cither  because  of  bearing  loss 
«»r  by  Ixriitg  snasked  by  mdse  la  rough  simu- 
lalira  of  a  sensorineural  hearing  loss). 

The  Kiytcr  et  al3  jmocedurc  is  more  di¬ 
rectly  applicable  to  hearing-impaired  jxqvj- 
lation'  but  :s  limited  in  that  equal  numbers 
of  oases  of  all  types  of  hearing  losses  were 


4» 

not  included.  The  Klump]>  ami  \\  ebster  ,' 
proccilure  is  only  intuitively  related  to  Isear- 
ing-imj«aircd  jsojiulations  but  does  inelu<!e  a 
much  wilier  distribution  of  masking  sjnetra 
( j«otemia!  simulate*!  hearing  losses  of  all 
tyjies).  Statistic:*!  analyses  on  the  Krvter 
cl  al  3  «lata  describe  characteristics  of  noise- 
induced  hearing  loss  as  measured  by  sjieech 
discrimination  tests.  Analyses  of  the  Klumpji 
and  Webster 15  «iata  describe  characteristics 
of  speech  discrimination  as  influenced  by 
masking  noises  t  simulated  hearing  b»sse.-*. 
There  should  be.  and  indeed  there  are.  many 
similarities  between  the  results  of  the  two 
baste  sets  of  <iata.  Both  study  the  discrimina¬ 
tion  of  s; »td rum-limited  speech. 

in  the  Klumjiji  and  Webster  exj«iawi;.!1 
each  of  eight  normal-hearing  subjects  lis¬ 
tened  in  turn  to  Rhyme  iK  word  lists  in  shr 
presence  of  16  diverse  noise  spectra.  The 
procedure  was  to  adjust  the  ievd  of  the  16 
noises  until  each  noise  reduced  the  j»ercent- 
age  of  correctly  heard  words  to  50G .  ‘’‘he 
words  were  jirciecurded  ami  jined  lack  at 
a  constant  levd  i  of  7S  <lb  at  i  meter  frrai 
the  loudsjscaker  or  at  the  ear  of  the  lis¬ 
tener  I.  The  *ie!ails  of  this  cxjenmai!  arc 
describe*!  in  a  jGj«er  by  Klumpp  ami  Wci»- 
ster.33  inn  suftke  it  to  add  here  that  Rhvrof 
words  are  a  series  of  50  monosyllabic  wor«is 
like:  lot.  can  .  .  .  get.  Ead:  word  has  five 
alternatives  t  such  as  cot.  tan.  .  .  .  let:  go', 
fan.  ...  bet.  etc)  so  that  on  the  answer 
sheets  *  -ot,  -an. . ..  -tt)  only  the  initial  con¬ 
sonant  need  be  supplied.  Most  of  the  16 
noises  were  quass-steady-state  anti  the  sjev- 
tra  varied  faun  a  Inv-irojim}’  runildc  and 
generator  hum.  to  jet  aircraft  noises  indud- 
ing  hsgh -frequency  whines.  There  a 
iyj'ewriter  sound  and  a  muhivoice  hablde. 
i-abompiry-gcncnatcd  thermal  noises  of  flat 
sjectruni  and  ot  an  increasing  6  db  jwt  ic 
tavc  qef.nan  i  profominantly  high  fre¬ 
quency  i  ami  of  a  -kcreasing  6  «fl»  j»cr  octave 
were  also  used  These  noises,  unlike  clinkal 
hearing  levels,  simulated  as  many  low- 
frequency  or  high-frequency  losses  as  Sat 
losses. 

The  jnsqmse  of  the  original  papers  1,15 
was  to  find  simple  means  of  measuring  the 
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16  noises  which  had  already  been  adjusted 
in  level  to  be  equally  sj>eech-interfering. 
The  analogous  problem  in  clinical  audiology 
would  be  to  find  the  common  hearing  level 
pattern  of  a  group  of  cases  all  of  wliem 
scored  50%  on  a  word  list  presented  at  the 
same  level  to  each  of  them. 

In  the  first  published  version  of  these  re¬ 
sults,5*  it  was  found  that  the  best  simple 
measure  of  predicting  the  speech  interfer¬ 
ence  of  these  noises  was  to  find  the  level  of 
noise,  measured  in  db,  in  the  octaves  300  to 
600, 600  to  1,200,  and  1.200  to  2.400  cps  md 
then  average  these  three  levels  (roughly 
analogous  to  finding  the  average  hearing 
level  at  425,  850,  and  1  JOG  cps).  A  subse¬ 
quent  paper 15  found  that  an  average  of  the 
noise  levels,  in  db.  of  octaves  centered  at  500, 
1.000.  and  2.000  cps  was  nearly  equivalent 
to  the  slightly  lower  octaves  of  425, 850.  and 
U00  cps.1*  In  both  cases  these  results  were 
better  than  average  levels  for  octaves  cen¬ 
tered  at  S50,  I JOO.  and  3.400;  or  at  1.000, 
2.000.  and  4.000  cps. 

likewise,  it  was  found  that  the  one  single 
octave  that  predicted  better  than  any  other 
single  octave  was  the  octave  centered  at  850 
ter  1.000)  cps,  followed  in  order  by  425  (or 
500*  cps.  iJ00  (or  2.000}  cps.  and  finally 
3.403  for  4.000)  cps. 

To  the  extent  that  the  level  of  a  masking 
noise  in  a  given  region  corresponds  to  a 
hearing  less  in  the  same  region,  the  impor¬ 
tant  frequencies  are  in  order;  I  .COO.  500. 
2.0G0.  and  4JOGO  cps.  and  the  average  of 
500.  t.GOO.  and  2JQQQ  cps  is  superior  to  the 
average  of  1.000. 2.000.  ar«i  4.000  cps. 

The  Importance  Frequency  in 
Speech  Perception 

The  center,  mid,  or  mean  frequency  of 
Kryters2  1.000.  2.000.  3,000  cps  scheme  is 
1JSI7  cps  (cube  root  of  the  product »  whereas 
the  mean  frequency  of  the  Committee  on 
Conservation  of  Hearing  scheme  is  1,000 
cps.  If  this  mid  frequency  be  called  the  most 
important  frequency  in  speech  perception, 
there  is  a  large  amount  of  literature  on  the 
subject. 


French  and  Steinberg.2  using  sj>cech  in 
tlte  quiet  and  normal  listeners,  progressively 
high-  and  later  low-pass-fihered  the  sjsecch 
until  it  became  progressively  less  intdligible. 
This  was  a  discrimination,  not  a  threshold 
test.  They  found  ihat  speech  was  equaily 
deteriorated  when  all  frequencies  either 
above  or  belon  1,900  cps  were  filtered  cut. 
Or  the  frequency  range  above  1,900  cps  was 
as  important  as  the  frequency  rznge  bekr.v 
1,900  cps.  Iieranek,1*  using  male  voices  only, 
found  the  crossover  frequency  to  be  1.660 
cps  under  the  same  quiet-filtered  -sj»eech 
conditions. 

Pollack  *•  redid  the  filtered  speech  intelli¬ 
gibility  studies  but  added  a  broad-hand  noise 
background  and  varied  the  level  of  the 
speech.  He  found  that  the  crossover  <cr 
equal  importance)  frequency  increased  from 
£00  q*s  for  low  levels  of  speech  through 
1 .010, 1,300. 1,430,  to  1.630  cps  for  increases 
of  10  db  in  the  speech  kvd. 

Pyer.”  doing  the  reverse  of  Pollack 
(namely,  filtering  the  noise  around  a  broad 
band  speech  signal),  found,  like  Pollack,  that 
as  the  speech-to-iMHse  differentia!  increased, 
the  crossover  frequency  increased  from 
about  1.000  cps  to  almost  2.000  cps. 

A  crossover  frequency  has  been  derived 
from  the  Khnnpp  and  Webster 12  data  and 
published  along  with  details  of  its  dcriva- 
lion.5*  The  data  in  condensed  form  are 
shown  in  Fig  1-4.  Fig  1  shows  the  masked 
audiograms  dre  to  the  predominantly  low  fre¬ 
quency  noises  from  Webster  and  Klumpp.1* 
The  masked  audiograms  are  plotted  as  dif¬ 
ferences  from  the  masked  audiogram  due 
to  the  thermal  or  fiat  noise,  but  the  relative 
levels  and  slopes  are  representative  of  the 
noises  as  adjusted  to  be  equally  speech- 
interfering-  Fig  2  and  3  show  similar  plots 
(again  related  to  the  fiat  noise  masked  audio¬ 
gram)  of  the  slightly  low’  frequency  ami 
nearly  fiat  spectra  noises.  The  area  enclosed 
by  the  spread  o:  the  noise  audiograms  in  Fig 
1.  2.  ar.d  3  is  plotted  in  Fig  4  at  the  top. 
At  the  bottom  of  Fig  4  are  the  average 
masked  audiograms  of  Fig  !.  2.  and  3  to¬ 
gether  with  the  one  predominantly  high  fre¬ 
quency  noise-masked  audiogram.  Observe  h? 
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Fig  4.  —  Sqicnuiiiici! 
of  mi'ifti  asjdk^rsini 
«3z»e  to  ijw  trz  of  equally 
<-j<nrh  nScrirri.".; 
■itmic  tTKiwtr  or  fm- 
;*csanrr  freqatnry 


Fi«  4  I  ha;  bush  the  preokicniuantly  h^a-  ami 
knr-?ittjatacT  not?e  amSograras  ami  she 
sissalr  irigb-  ami  km-ireqacacT  noise  a»&>- 
^ass  cross  each  ocher  ai  aboot  S50  cj»f. 

Kryter  —  at?  ha-  «snc  ■data  of  jjetdi 
masked  J>y  acee  which  rboa  a  km-  cro>v 
ocer  fremx ncy.  If  the  information  in  his 
Fig;  2  ami  3  of  net  22  were  mcncnhened  ««• 
■sburar  the  Itwt  of  she  ff«e  SfuKlra  fin  !i” 
2.  ref  22i  stotoi  to  yield  equally  jjvjtdi- 
iaierfeii^  races  of  Ip?  <  <  irom  Fi»  3.  res 
22 1  she  !»»  Se«Jv  fJogiei  aaifts  rrse- 
teiaaiir  high-  ami  kw-froptney  notrs  I 
wsaid  cross  at  about  900  cps  and  she  nn*e 
gently  sloped  h^j-  ami  Fwr  frrqtKixr  noise> 
at  I.4(Wc|ts. 

Is  appear?  therefore  from  she  evidence  of 
PeSack.14  Dyer.15  am!  Kr,;cr.=  ami  U‘ci>- 
ster  ami  Khsnpp !c  that  noise- masked  ssecch 
has  a  oossner  or  importance  frequency  a> 
much  as  an  octave  sower  Shan  she  crossover 
frequencies  of  altered  speech  in  the  quiet 
r  French  ami  Nadsrtg.3  ami  iJoaacs!Si. 
IV«h  3  VtJiack  ami  Dyer13  shew  that  the 


!re|Bcny  varies  irun  S00  or  1-000  cps  to 
I  fO 0  tn  ZPOO  cps  as  she  S{eerfi-5o-aoisc  dif- 
tcrential  increase- . 

Tbese  avowal  anomalies  kd  to  further 
work  on  speech  interference  criteria.  In  a 
jojer  by  Webster,*  the  articulation  index 
i All  developed  In*  French  ami  Sternberg' 
am!  its  ajediration  in  noises  of  various  !ev- 
eis  Sfti  to  a  set  of  contours  which  seems  to 
explain  what  hapjscns  when  fecaire;  to 
speech  in  increasing  levels  os  noise  lor  in¬ 
creasing  hearing  losses).  Fig  5  shows  that  as 
the  level  of  noise  in  an  octave  band  increases 
<  analogous  Jo  increasing  hearing  losses ).  the 
minimum  point  of  the  contour  tlrat  traces  out 
any  one  given  ankuiation  index  i  A 1 1  level 
•iccneascs.  in  frequency.  In  simple  terms,  as 
the  listening  condition?  get  worse  «roore 
noise  or  presumably,  greater  hearing  losses), 
the  importance  frequency  gets  sower. 

In  terms  of  the  original  argument  of 
whether  500.  1.000.  and  2.GG0  q»s  midfre- 
qucncc  l.COO  q»>;  or  1.000.  2.000.  ami  3.000 
q*i.  nrnlfrequcncy  ii?I7  cps.  be  used  as  a 
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pure-tone  losses  to  predict  ability  to  hear 
speech  until  a  point  is  found  that  de¬ 
fines  some  "compensable,"  or  otherwise 
adequately  defined  "handicap"  in  the  im¬ 
pairment-handicap-disability  scale.  If  the  cri¬ 
terion  point  is  close  to  the  "impairment" 
end  of  the  scale,  then  perhaps  a  center  fre¬ 
quency  around  1,800  cps  is  the  best  pre¬ 
dictor.  If,  however,  the  criterion  point  is 
defined  more  in  the  center  or  toward  the  dis¬ 
ability  end  of  the  scale,  a  center  frequency 
at  or  below  1,000  cps  would  be  the  best 
predictor. 

John  C.  Webster,  PhD,  US  Navy  Electronics 
Laboratory,  San  Diego,  Calif  92152. 
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Speech  Communications  as  Limited  by  Ambient  Noise 


J.  C.  WEBSTER 

l'.  S.  Navy  Mrclrnrics  Lahoralory,  San  Dir^n,  California 
(Keccived  17  August  1964) 


SiKrcch-intcIlisibilily  scores  as  a  function  of  noise  level  are  studied  for  face-to-face,  sound-powered-phone, 
and  amplified  speech-  (earphone  and  loudsjicakcr)  communication  conditions.  The  speech-interference  level 
(SIL)  for  octaves  of  noise  centered  at  500, 1000,  and  2000  cps  (0.5/1/2)  is  used  as  the  measure  of  noise  level. 
By  using  this  noise  measure,  much  of  the  work  in  this  field  can  1*  Brought  together  and  interpreted.  It  is 
noted  that  “noisy"  and  “very  noisy"  spaces  are  associated  with  STL's  such  that  “shouting"  or  “very  loud” 
voice  levels  (or  95-dB  s|,ccch  levels)  arc  required  for  conversations  at  1 .5  or  5  ft,  and  this  is  the  region  where 
telephone  conversations  arc  judged  to  lie  “difficult"  or  “unsatisfactory."  All  of  these  adverse  noise  con¬ 
ditions  occur  at  the  region  where  ear  protection  will  aid  intelligibility  and  at  the  boundary  where  car  protec¬ 
tion  should  l>c  used  to  protect  against  hearing  losses.  Where  |>coplc  must  converse  or  communicate  via  some 
interior  communication  device,  0.5/1/2  SIL's  above  70  d»  should  1*  avoided.  At  0.5/1  /2  SIL’s  greater  than 
90  dB,  the  wearing  of  hearing  protection  should  l>c  made  mandatory  and  every  noiseproofing  technique 
(except  a  noise  shield  for  the  microphone)  should  be  employed.  At  0.5/1/2  SIL's  above  100  dB,  every  noise- 
proofing  technique  should  be  employed. 


INTRODUCTION 

VER  the  past  few  years,  several  aspects  of  speech 
intelligibility  in  noise  have  been  studied  at  the 
U.  S.  Navy  Electronics  Laboratory  (NEL).  Many 
speech-communication  equipments  and/or  systems  have 
been  evaluated  for  operation  both  in  the  quiet  and  in 
noise.  Many  of  these  evaluations  have  come  out  as 
NEL  reports1 * * 4"7  or  were  published  in  journals  not 
readily  available.*-*  Others,  especially  those  dealing  with 
the  noise-attenuating  properties  of  ear  protection, ,0-u 


1  J.  C.  Webster  and  R.  G.  Klumpp,  “USNEL  Flight  Deck  Com¬ 
munications  System.  Part  2.  Noise  and  Acoustic  Aspects,”  NEL 
Rept.  923  (29  Nov.  I960),  AD  260  286. 

*  W.  E.  Montague,  “A  Comparison  of  Five  Intelligibility  Tests 
for  Voice  Communication  Systems,"  NEL  Rept.  977  (27  June 
1960),  AD  254  545. 

*  J.  C.  Webster  and  R.  G.  Klumpp,  "Evaluation  of  the  AN/PRC- 
53, ,r  NEL  Rept.  1042  (18  Apr.  1961),  AD  260  294. 

4  J.  C.  Webster,  P.  O.  Thompson,  and  T.  H.  Wells,  “Evaluation 
of  the  AN/PRC-44(XN-1),”  NEL  Rept.  1058  (30  July  1961). 

*  J.  C.  Webster  and  P.  O.  Thompson,  “Noise-Proofed  Sound 
Powered  Phones,”  NEL  Rept.  1073  (25  Oct.  1961). 

*  J.  C.  Webster,  P.  O.  Thompson,  and  H.  R.  Beitscher,  "Intelli¬ 
gibility  of  Amplified  Speech  in  Helicopter  Noise,”  NEL  Rept. 
1080  (7  Nov.  1961). 

7  J.  C.  Webster  and  R.  G.  Klumpp,  “Technical  Evaluation  of 
the  AN/SRC-22  (XN-1)  Flight  Deck  Communication  System,” 
NEL  Rept.  1141  (15  Oct.  1962). 

*J.  C.  Webster  and  P.  O.  Thompson,  “Dynamic  or  Carbon 
Microphone?”  Bur.  Ship  J.  10,  8-9  (14  Sept.  1961). 

*  J.  C.  Webster,  R.  G.  Klumpp,  and  P.  O.  Thompson,  “Capa- 


have  appeared  in  available  journals  but  the  application 
of  the  results  to  communication  systems  need  be  ex¬ 
amined.  In  many  of  these  evaluations,  the  results  were 
comparative  in  nature  (one  system  or  equipment  was 
evaluated  in  terms  of  another)  and  no  attempt  was 
made  to  interpret  the  results  in  absolute  terms. 

Relating  the  results  of  these  evaluations  to  each  other 
and  to  the  work  of  others  has  been  difficult  because  of  the 
diverse  ambient  noises  used  and  the  differences  in  the 
way  both  speech  levels  and  noise  levels  have  been 
measured.  A  recent  series  of  papers17-'5 *  concerning 
methods  of  predicting  speech  interference  provides  a 
framework  that  makes  it  easier  to  generalize  some  of 
the  evaluative  results.  It  is  the  purpose  of  this  paper  to 


bilities  of  Speech  Communication  in  Noise,”  Troc.  Inst.  Environ. 
Sci.,  297-307  (Apr.  1962). 

10  J.  C.  Webster,  “Ear  Defenders:  Measurement  Methods  and 
Comparative  Results,”  Noise  Control  1,  No.  5,  34-42  (1955). 

11  J..C.  Webster  and  E.  R.  Rubin,  “The  Noise  Attenuation  of 
Selective  Ear  Protective  Devices,"  Sound — Its  Uses  and  Control 

1,  No.  5,  34-46  (1962). 

u  R.  G.  Klumpp  and  J.  C.  Webster,  J.  Acoust.  Soc.  Am.  35, 
1328-1338  (1963). 

“J.  C.  Webster  and  R.  G.  Klumpp,  J.  Acoust.  Soc.  Am.  35, 
1339-1344  (1963). 

14  R.  G.  Klumpp  and  J.  L.  Leonard,  “Observer  Variability  in 
Reading  Noise  Levels  with  Meters,”  Sound — Its  Uses  and  Control 

2,  No.  4,  25-29  (1963). 

15  J.  C.  Webster,  “A  Speech  Interference  Noise-Rating  Contour,” 
Acustica  (to  be  published). 
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5  P  E  E  c  11  C  0  M  M  f  X  5  C  A  T  I  O  X  S  !  X  A  M  B  I  E  X  T  NOISE 


summarize  capabilities  of  speech  communication  in 
noise  and  in  some  instances  to  compare  the  results  with 
other  studies  not  directly  concerned  with  equipment 
evaluation. 

I.  OVERVIEW  OF  SPEECH  INTELLIGIBILITY  IN 
NOISE 

The  Rosetta  stone  that  allows  a  comparison  among 
the  XEL  evaluations  and  between  them  and  other 
studies  in  the  general  field  of  speech  intelligibility  in 
noise  is  shown  in  Table  I.  Table  I  is  the  Rosetta  stone 
if  it  is  assumed  that  it  is  the  level  and  spectral  distri¬ 
bution  of  noise  that  most  affect  speech  intelligibility. 
Factors  such  as  the  choice  of  speech  materials,  and  the 
selection  and  training  of  talkers  and  listeners,  preclude 
making  exact  comparisons. 

Table  I  (and  its  predecessors.  Table  H  and  Fig.  IS 
of  Ref.  12,  and  Table  I  and  Fig.  3  of  Ref.  15)  shows 
physical  measures,  such  as  an  A,  a  C,  and  a  proposed 
Sf  (speech  interference)151*  sound-level  meter  reading, 
and  speech  interference  level  (SIL)  measures'7  (at 
600-4800,  300-4800,  and  5(X),  1000,  2000  cps)1*  for 
noises  of  different  spectra  that  are  equated  to  be  equally 
speech-interfering.  In  addition  to  listing  the  levels  of 
different  noises  that  are  equally  speech-interfering, 
Table  I  also  lists  several  equivalent  measures  of  ant- 
single  noise,  ff  the  spectrum  of  an  unknown  noise  re¬ 
sembles  one  of  the  noises  in  Table  I,  any  single  measure 
of  this  noise  can  be  used  to  find  other  measures  and 
thereby  facilitate  comparing  the  results  of  one  study  or 
evaluation  with  others.1* 

With  this  much  explanation  of  Tabie  I,  it  is  time  to 
look  at  the  general  overview  of  the  various  NEL  evalu¬ 
ation  results.  Figure  1,  a  generalized  summary  of  many 
existing  studies,1  shows  the  scope  of  the  problem 
of  talking  and  listening  in  noise.  On  the  ordinate  is 


14  J.  C-  Webster,  J.  Acoust.  Soc.  Am.  36,  1662-1669  (1964). 
The  frequency-dependent  curve  upon  which  the  SI  filter  is  based 
is  shown  in  Fig.  2,  Ref.  15,  or  as  the  SI-70  curve  in  Fig.  1  of  this 
reference.  To  make  the  filter,  the  inverse  of  this  SI-70  curve  is 
used.  The  70  refers  to  the  fact  that  origiona!  curve  was  based  on 
noises  adjusted  in  level  to  be  equally  speech- interfering  at  an  SIL 
level  ol  just  over  70  dB  (see  Table  I  in  this  paper  or  Table  II, 
Ref.  12). 

15  L.  L.  Beranek,  “Aiqilane  Quieting  II — Specifications  of  Ac¬ 
ceptable  Xoise  Levels,”  Trans.  ASME  69,  97-100  (1947). 

14  SIl.’s  are  based  on  arithmetic  averages  of  noise  levels  in  octave 
lands.  The  octave  bands  can  be  specified  by  lower  and  upper 
land  limits,  or  bv  center  frequencies.  The  band  centers  of  octaves 
ranging  from  600  to  4SOO  cps  are  850,  1700,  and  3400.  The  band 
limits  of  octaves  centered  at  500,  1000,  and  2?v"'  cps  are  approxi¬ 
mately  350  to  2800  cps.  Because  of  past  (land  h  i )  usage,  0.6/4.S 
SIL  (not  8.5/17/34)  is  used  to  specify  the  SIL  calculated  from  the 
octaves  600-1200, 1200-2400,  2400-4800  cps.  And  to  nvuntain  the 
simplicity  inherent  in  its  choice,  0.5/ 1/2  SI :  (not  0.35/2.8)  is  used 
for  the  SIL  for  octave  bands  centered  at  500,  1000,  and  2000  cps. 

>*  R.  W.  Young,  J.  Acoust.  Soc.  Am.  36, 289-295  (1964).  Young 
has  compiled  a  Table  similar  to  Table  I,  associating  various  meas¬ 
ures  of  noise  to  idealized  noise  spectra.  Young's  Table  is  not  for 
tqually  speech-interfering  noises,  but  for  noises  equal  on  A 
weighting,  which  for  existing  sound -levd-meler  networks  is  the 
best  for  predicting  speech  interference. 


Table  I.  Levels  in  dB  of  noises  producing  50%  scores  for 
rh\  me  words. 


No. 

Xoise 

Name 

Sound-level 

C  A 

meter 

SI-70* 

S|>eech- 

0.6/-I.8' 

interference  levels 
0.3/4.8*  0.5/l/2J 

1 

Rumble  (  — 12) 

105 

86 

83 

66 

71 

72 

3 

Blower  (-9) 

92 

85 

SS 

6; 

72 

74 

4 

TN  -6 

87 

79 

8! 

69 

72 

73 

... 

Helicopter* 

(82) 

(82) 

(SO) 

(73) 

(72) 

(74) 

9 

Babble 

81 

82 

80 

75 

73 

f  .» 

10 

TN  fiat 

8; 

82 

SO 

75 

73 

73 

15 

TN-r-6 

88 

90 

84 

80 

77 

75 

16 

Je* 

94 

94 

86 

81 

80 

79 

Average 

88.8 

€5.0 

82.4 

73.2 

73.8 

74.1 

Range 

24 

1$ 

6 

15 

9 

7 

*  Calculated  (not  measured)  and  equaled  with  "A"  on  noise  No.  V 
i  Not  in  original  (Ref.  12)  study,  equated  on  "SI-70”  with  noise  Nc.  10. 
«  These-  are  band-limiting  frequencies;  the  band  center  frequencies  arc 
0.85/1.7/3.4  and  0.43/0.85/1.7.  .  . 

A  These  are  the  center  frequencies  of  octave  bands;  the  band-hunting 
frequencies  arc  0.35/2.8. 

plotted  percentage  of  thyme  words5'  heard  correctly 
and  along  the  abscissa  is  the  ievcl  of  noise. 

The  choice  of  the  0.5/I/2  SIL  of  an  equivalent 
—  6-dB/oct  thermal  noise  is  based  on  two  facts.  The 
first  is  that  the  0.5/1/2  SIL  is  a  reasonable  compro¬ 
mise  for  showing  small  numerical  fluctuations  among 
the  physical  measurements  of  the  original  16  equally 
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Fig  1.  Speech  intelligibility  (percent  rhyme  words  correct)  as  a 
function  of  jet-aircraft  idling  noise  level.  On  the  lop  abscissa, 
noise  levels  arc  listed  as  measured  on  the  C-weighting  network 
of  a  sound-level  meter.  On  the  bottom  abscissa,  the  noise  level  is 
listed  as  the  speech  interference  level  (SIL),  based  on  the  octaves 
centered  at  500,  1000,  and  2000  cps,  of  a  minus  6-dB/oct  shaped 
thermal  noise  that  is  equivalent  in  its  ability  to  interfere  with  the 
intelligibility  of  speech  to  the  jet -noise  levels  listed  on  the  top 
clrsdssa.  Three  generic  types  of  results  are  shown;  face-to-facc, 
sound -[lowered  phone,  and  amplifieo  speech.  Within  the  face-to- 
face  results,  the  parameter  is  distance  between  talker  and  listener. 
The  limits  on  the  sound  powered  results  are  present-day  “opera¬ 
tional”  equipment  (to  the  left)  and  “developmental”  equipment 
(to  the  right).  In  the  amplified-spccch  results,  the  major  parameter 
is  presence  or  altscnce  of  a  microphone  shield.  When  a  shield  is 
used,  a  subparameter  is  whether  or  not  dipping  is  used  for  ear¬ 
phone  listening.  When  a  shield  is  not  used,  the  subparameter  is 
whether  an  average  (left!  or  excellent  (right)  earmutt  is  used 
around  the  earphone. 


**  G.  Fairbanks,  J.  Acoust.  Soc  Am.  30, 596-600  (1955). 
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speech -interfering  noises.12  The  second  is  that  the 
—  6-dB  'oct  noise  is  z  reasonable  compromise  among 
noises  to  be  representative  of  ship  noises,1*  office 
j  noises, *'•**  and  noises  used  in  laboratory  studies  of 

1  speech  intelligibility.*1 

jt  Figure  1  deals  with  three  specific  communication 

situations:  face-to-face,  sound-powered  phone,  and 
amplified  speech.  It  shows  for  each  form  of  communi¬ 
cation  the  limiting  noise  levels  for  given  degrees  of 
communication  effectiveness  (percentage  of  rhyme 
words  correct).  For  the  majority  of  the  studies  sum¬ 
marized  in  Fig.  1,  a  single  expe.  fenced  talker  and  five 
experienced  listeners  were  used*  The  following  three 
sections  deai  in  detail  with  each  communicating  situ¬ 
ation  :  face-to-tace,  sound-powered  phone,  and  amplified 
speech.  In  each  section,  repeated  reference  is  made  back 
to  Fig.  1. 

U.  FACE-rO-FACE  COMMUHiCATJON 

The  most  satisfactory,  but  least  noise-resistant,  com¬ 
munication  is  face-to-face  communication.  For  these 
face-to-face  tests,  no  constraints  were  put  on  the  vocal 
level  of  the  talker.  Nor  was  the  talker  asked  to  maintain 
any  given  level  of  word  intelligibility.  He  was  in  the 
same  room  with  his  listeners.  He  could  see  them  and  he 
could  hear  the  ambient-noise  level  around  them.  The 
listeners  faced  the  talker  but  the  rate  of  word  presen¬ 
tation  was  such  that  lip  reading  can  be  almost  com¬ 
plete!  ■  discounted.  The  voice  level  that  he  adopted  was 
left  to  his  own  knowledge  of  the  test  situation. 

The  lin.iting  factor  in  face-to-face  communication  in 
noise  is  the  distance  between  the  talker  and  the  listener, 
since  the  potential  voice  level  of  the  talker  and  accept 
able  listening  levels  are  physiologically  limited.  Observe 
in  the  four  curves  to  the  left  in  Fig.  1  that  at  any  single 
criterion  level,  say  70%  correct,  for  each  doubling  of 
ihe  distance  between  talker  and  listener,  6  d Bless  noise 
can  be  tolerated. 

To  show  how  the  face-to-face  data  in  Fig.  1  compare 
with  the  extensive  work  of  Bcranek,1’*1—  which  culmi¬ 
nated  in  the  0.6  4.8  SIL,  note  the  bars  superimposed 
at  the  85%  rhyme-word  position  between  the  0.5.  1  2 
SIL  levels  of  48  and  84  dB.  For  esse  of  viewing,  the 
bars  are  placed  under  one  another ;  however,  they  should 
be  thought  of  as  being  located  at,  and  only  at,  the  85% 
point.  The  85%  position  was  chosen  because  Ref.  24 
states  that  barely  reliable  conversation  corresponds  to 
a  PB  word  score  of  75%.  And  Montague*  shows  that  a 
PB  score  of  75%  corresponds  to  a  rhyme  score  of  85% 
(and  a  Harvard  sentence*1  score  of  93%). 

55  L.  L.  Beranck.  J.  Acoust.  Sac.  Am.  2S,  SB-S52  (1956). 

=  L.  L.  Beranek,  “Revised  Criteria  for  Noise  in  Buildings,” 
Noise  Control  3,  No.  1,  19-i'7  (1957). 

3  K.  D.  Kryter,  J.  Acoust  Soc  Am.  18,  413-417  (1946). 

*  K.  D.  Krvtcr,  J.  C.  R  Licklidcr.  J.  C-  Webster,  and  M. 
Hawley,  “Speech  Communicition."  in  Human  lixpueering  Guide 
to  Equipment  Design,  C-  T.  Morgan  ei  ai ..  Eds.  (McGraw-Hill 
Book  Co.,  Inc.,  New  York,  *9V3),  Chap.  4.  p.  179. 

“J.  P.  Egan,  “Articulati  »n  Testing  Methods,”  OSRD  Kept. 


The  SIL  bars  represent,  from  top  to  bottom,  the 
range  of  noise  levels  in  which  conversation  can  be  carried 
on  when  the  talkers  and  listeners  are  12, 6, 3,  and  1.5  ft 
apart.  The  left  end  of  each  bar  represents  “normal” 
voice  level,  the  right  “shout” ;  equally  spaced  in  between 
are  “raised”  and  “very  loud”  voice  levels.  It  is  under¬ 
stood,  of  course,  that  if  a  “normal”  level  is  under¬ 
standable  at  any  given  level  of  noise  it  will  also  be 
understood  at  lesser  levels  of  noise. 

The  SIL  bars  were  placed  between  48  and  84  dB  by 
adding  5  dB  onto  the  levels  shown  in  any  Table  showing 
0.6  4.8  SIL  versus  distance  and  voice  level  (say,  Table 
4-13  of  Ref.  24).  This  5  dB  accounts  for  the  difference 
between  the  0.6  4.8  SIL  and  0.5  1  2  SIL’s  for  noises 
similar  to  noises  3  and  4  in  Table  I.  Noises  3  and  4  are 
typical  of  the  office  noises  used  by  Beranek1' •**•**  in  de¬ 
veloping  the  SIL. 

The  fact  that  the  bars  (representing  Beranek’s  SIL 
formulation)  are  almost  exactly  bisected  by  the  proper 
distance  contours  (where  voice  level  was  at  the  option 
of  the  talker)  shows  that  the  two  sets  of  data  are  highly 
compatible. 

The  maximum  noise  level  in  which  face-to-face  com¬ 
munication  is  possible  can  probably  be  set  at  about 
0.5  1  2  SIL  of  95  dB.  Note,  for  example,  the  maximum 
rating  in  0.6  4.8  SIL  Tables*4  is  89  dB  (or  94  dB 
0.5  1  2  S!L).  Or  note  that  Pickett**  states,  “The  maxi¬ 
mum  tolerable  noise  levels  for  90%  sentence  intelligi¬ 
bility  and  1  m  between  talker  and  listener  were  esti¬ 
mated  to  be  95  dB  for  white  noise  [noise  No.  10,  Table 
13  and  105  dB  for  low-frequency  noise.”  These  over-all 
levels,  when  converted  (from  the  spectra  given  in 
Pickett’s  original  paper)  to  0.5  1  2  SIL,  become  88  and 
96  dB,  respectively.  Pickett's**  conditions  were  for  a 
1-m  distance  between  talker  and  listener  whereas  the 
0.6  ’4.8  SIL  Table  specified  0.5  ft.  But  Pickett  was  using 
a  highiy  trained  talker  and  looking  specifically  for  a 
maximum  level.  The  0.6  4.8  SIL  Table  is  for  average 
talkers  and  listeners. 

An  extrapolation  to  Fig.  2  of  Ref.  27  also  shows  an 
over-all  level  of  95-dB  thermal  noise  (0.5  1  2  SIL=87 
dB)  to  evoke  the  maximum  usable  speech  level  from 
talkers  communicating  at  1  m.  Face-to-face  communi¬ 
cations  were  not  attempted  in  levels  this  high  in  the  data 
of  Fig.  1.  An  extrapolation  of  the  “distance-halved” 
contours  at  S5%  rhyme  scores  at  95  dB  0.5/1.  2  SIL 
shows  the  permissible  distance  to  be  roughly  2.25  in. 
Ail  of  these  values  tend  to  confirm  the  fact  that  face-to- 
face  (actually  mouth  to  hearing-protected  ear)  com¬ 
munication  ceases  at  about  115  dB(C)  In  jet  noises  on 
aircraft  carriers. 

3S02  (1944);  I’B  22348.  Also  published  in  l.arvngosco]>c  5S. 
955-991  (1948). 

51  J.  M.  Picket!.  J.  Acoust.  Soc.  Am.  30.  278-  2S1  (19.381. 

SJ.  C.  Webster  and  K.  G.  Klumpp,  J.  Acoust.  Soc.  Am.  31, 
936  941  (1962). 
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III.  SOUND-POWERED-PHONE  COMMUNICATION 

The  sound-powered  phone  (SITj,  as  iis  name  implies, 
requires  no  source  of  external  jjmver.  its  sole  source  of 
audiofrequency  energy  is  sound,  preferably  voice  sounds 
but  also,  of  course,  noise.  It  achieves  an  orlhotelephonic 
gain  of  20  dB  (a  gain  of  20  dB  over  the  level  of  a  voice 
in  air  at  1  m)  by  utilizing  a  resonance  phenomenon7'  to 
take  full  advantage  of  the  existing  energy  in  a  narrow 
band  around  151X)  cps.  It  sacrifices  quality  (wide  band 
frequency  response)  for  quantity  (more  speech  power 
per  unit  bandwidth).  Typical  frequency  responses  of 
sound-powered  phones3*  show  peaks  at  frequencies 
varying  from  .SO!!  to  25(K)  cps,  and  very  little  resjtonse  at 
all  beyond  plus  or  minus  1  ocl  from  the  peak  frequency. 

Wiener3'  states  that  “The  rapid  decrease  in  response 
above  15!>0  cps  is  due  to  both  earphone  and  micro¬ 
phone.  .  .  .  The  decrease  ...  at  the  lower  frequencies 
is  due  mainly  to  the  microphone.”  In  discussing  whether 
improved  performance  could  be  obtained  by  centering 
the  resonance  at  another  (lower)  frequency,  Wiener 
(based  on  data  of  Egan  and  Wiener,  Ref.  2S,  S"c.  7.2.4, 
but  published  elsewhere  later*')  concludes,  "...  net 
gain  in  performance  will  result  from  shift  in  frequency 
.  .  .  only  if  the  orlhotelephonic  gain  or  the  bandwidth 
or  both  arc  increased.”  Since  SIT  communication  is 
inherently  limited  by  its  frequency-response  cnaractcr- 
istics,  SIT  intelligibility  scores  for  quiet  conditions  are 
typically  between  K5 %  and  60%  I’B  words  correct 
rather  that  lift  ween  65%  and  1<X!%  as  arc  scores  for 
broad-band  equipment.  Because  of  this,  Wiener*  ques¬ 
tioned  (in  1646)  "...  whether  it  might  not  be  more 
profitable  to  abandon  the  sound-powered  principle  .  .  . 
ar.d  install  an  interphone  system  using  efficient  wide¬ 
band  instruments  and  electronic  amplifiers.**  In  refuting 
his  own  doubts,  Wiener3*  listed  reasons  for  the  con¬ 
tinued  widespread  use  of  sound-powered  phones  in 
Naval  ships.  He  mentioned  “reliability”  and  the  fact 
that  the  sound-powered  phone  ”...  performs  satis¬ 
factorily  under  conditions  of  relative  quiet  .  .  .  (and  j 
.  .  .  is  the  simplest  (system  j  possible.”  He  did,  however, 
suggest,  and  made  preliminary  tests  to  show,  the  ad¬ 
vantages  of  noisc-cancclling  microphones  and  noise 
shields  for  microphones,  and  the  use  of  better  ear 
cushions  around  the  earphones.  Ali  o:  these  schemes 
have  sultscqucnily  been  tested  and  have  extended  the 
potential  use  of  sound-powered  phones  into  higher  and 
higher  noise  fields. 

Whether  the  SIT  system  should  remain  the  mainstay 
of  ESN  ships'  interior  communications  in  this  day  of 
higher  noise  levels  (and  increasing  needs  for  passing 

s  p.  M,  Wiener.  "Sjwcial  Voice  Communication  Systems,” 
in  Transmission  and  Reception  of. Sounds  under  Combat  Conditions, 
C.  E.  Waring.  E>1.  (Sum.  Tech.  Kept.  Div.  J7  National  Defense 
Research  Commission.  Washington.  I).C..  1946»,  Yol.  3,  Chap.  12. 

3  L.  L.  Btranck  *ar*d  stall  of  Electro  Acoustic  lath..  Harvard 
Univ.i,  "Audio  Characteristics  of  Communication  Equipment." 
PXR-6  (1  Feb.  1945*.  Figs.  46  50. 

*  J.  P.  Egan  and  I .  M.  V»  iencr,  J.  Acoust.  Soc.  Am.  IS,  435  441 
(1946). 


more  complex  data  and  information)  is  a  s\  stents  prob¬ 
lem  beyond  the  scope  of  this  paper.  The  system  aspects 
of  the  problem  are  covered  elsewhere.11  The  purpose  of 
this  section  of  this  pap-'-r  is  to  assess  the  range  of  noise 
levels  in  which  speech  communication  is  possible,  using 
sound-powered  phones. 

When  the  same  five  listeners  and  the  same  talker 
used  in  the  face-to  face  experiment  are  icstcd  or.  SIT 
equipment,  the  SIT  results  summarized  :n  the  center  of 
Fig.  1  obtain.  With  present-day  operational,  nonnoise- 
proofed  phones,  the  results  are  no  better  than  face-to- 
face  communication  at,  say,  2  ft  (considering  the  7(1% 
criterion).  However,  "developmental”  equipment  uti¬ 
lizing  noise  cancelling  microphones  and  noise  attcunat- 
ing  cushions  around  earphones  does  extend  usable 
communications  to  noise  levels  beyond  fare-to-lace 
capabilities. 

One  of  the  differences  between  face-to-face  com¬ 
munication  and  a  wire-connected  system  is  that  in  a 
wire-connected  system  each  communicator  can  l>c  (ar.d 
usually  is)  in  his  own  noise  environment.  Figure  1  is 
based  on  the  premise  that  both  the  talker  and  the 
listener  are  in  the  same  noise  environment.  Figure  2 
reflects  speech-intelligibility  results  when  there  are  dif¬ 
ferent  noise  environments  around  the  talker  and  the 
listener.*3  Three  possible  cases  are  shown :  X  X,  X-Q, 
and  Q-X.  In  the  X  X  case,  both  the  talker  and  listener 
arc  in  the  noise  level  shown  on  the  abscissa.  These  are 
the  data  summarized  in  Fig.  1.  In  the  X-Q  case,  the 
talker  is  in  the  noise  level  along  the  abscissa  ar.d  the 
listener  is  in  relative  quiet  (the  noise  level  of  the  leftmost 
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Fig.  2.  Speech  intelli¬ 
gibility  i percent  /rhyme 
words  correct;  vs  jet- 
noise  level  for  SIT 
equipments.  When  both 
the  talker  and  the 
listeners  arc  in  the  noise 
levels  listed  on  the  ab¬ 
scissa  (N-Nt,  B  refers 
to  developmental  equip¬ 
ment,  W  to  operational 
equipment,  and  A  to  an 
average  of  al>oul  8 
equipments  (mostly  de¬ 
velopmental.  i’or  the 
X-Q  and  Q-X  condi¬ 
tions.  only  the  average 
results  are  shown.  X-Q 
means  talker  in  noise, 
listeners  in  quiet  (noise 
level  of  leftmost  data 
point*.  Q-X  is  the  re¬ 
verse-  talker  in  “quiet.*' 
listeners  in  noise.  The 
noise  levels  are  as  in 
Fig.  1 :  over-all  *Ci  level 
— 6-db  oc;  noise  that  is 


of  jet  noise  itop;  and  0.5  1,  2  SI1.  of  a 
equivalent  to  the  jet  noise  in  its  s;>ccch  interfering  properties. 


J'  J.  C  Wei/strr  and  1  B.  Henri.  “I)o  Sound  Towered  Phones 
Have  a  1  uturc.'"  tlo  Ik-  |  ublishoi  >. 

s  The  dala  are  from  Ref.  5  and  P.  O.  Thom;«son.  "Effectiveness 
of  Three  Sound  Powered  Telephone  Sets  in  High  Level  N*«sc.” 
NHL  “<ch.  Mem.  H)T  (11  June  1963;. 
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data  point  in  each  series).  In  the  Q-N  case,  the  talker 
is  in  the  noise  level  of  the  leftmost  data  point  and  the 
listener  is  in  the  noise  level  shown  on  the  abscissa. 

There  is,  of  course,  the  fourth  case  (Q-Q)  where  both 
the  talker  and  listener  are  in  relative  quiet.  This  is  the 
limiting  case  for  each  of  the  ether  three  cases.  In  Fig.  2, 
the  spread  of  the  leftmost  points  of  each  of  the  average 
conditions  (85±2%)  is  a  measure  of  the  variability  of 
the  exj>erimental  data.  Xote  in  Fig.  2  that,  when 
averaged  over  many  different  (operational  and  de¬ 
velopmental)  sound-powered  phones,  the  highest  intelli¬ 
gibility  scores  obtain  in  the  X'-Q  case,  then  the  X-X 
case,  and  the  lowest  scores  ;  re  for  the  Q-X  case  (since 
the  talker  doesn’t  “speak  up"  when  in  the  quiet).  The 
"BEST”  results  almost  always  obtain  when  the  latest 
and  best  developmental  “noiseproofed”  equipment  is 
tested.  The  "WORST”  intelligibility  results  are  from 
the  older  operational  equipment  not  designed  for  use 
in  high  noise  levels. 

Although  all  of  the  data  are  not  duplicated  here  from 
Refs.  5,  9,  32,  and  33,  some  generalizations  from  the 
more  detailed  data  are  made:  namely,  the  best  develop¬ 
ment?.!  noisepnoofed  sound-powered  equipment  can  be 
used  (70%  rhyme  words  correct)  in  noise  levels  of 
1 14  dB  0.5  T  2  SIL  of  — 6-dB  oct  thermal  noise  (TX-6) 
or  135  dB  over-all  jet  noise  if  the  listener  is  in  a  quiet 
location  (X-Q) ;  94  dB  SIL  or  115-dB  over-all  jet  noise 
when  both  talker  and  listener  arc  in  noise  (X-X); 
and  S4  dB  SIL  or  105- dB  jet  noise  if  the  talker  is  in 
the  quiet  and  cannot  be  induced  to  literally  shout  into 
his  microphone  (Q~X). 

IV.  AMPLIFIED  VOICE  COMMUNICATION, 
EARPHONE  LISTENING 

What  about  systems  utilizing  electronic  amplification 
and,  in  particular,  radio  systems  where  distance  be¬ 
tween  talker  and  listener  is  of  no  import  ?  Several  flight 
deck  radio  systems  designed  for  use  in  high-level  noise 
have  been  evaluated,  using  the  same  talker  and  listeners 
as  used  in  the  face-to-face  and  sound-powered-equip- 
ment  tests.**-7  Reference  back  to  the  right-hand  side 
of  Fig.  1  shoe  ,  the  limiting  results  of  such  talking- 
listening  tests.  Although  the  details  are  not  shown  in 
Fig.  1,  in  general  it  can  be  said  that,  when  a  microphone 
is  in  relative  quiet  (or  is  well-shielded;  or  when  the 
listener  is  in  quiet,  i.e.,  in  Q-X  or  X-Q  conditions, 
satisfactory  military -word  intelligibility  (greater  than 
70%  rhyme  word  score)  prevails  up  to  levels  greater 
than  125-dB(C)  jet  noise.  Only  when  both  the  taiker 
and  the  listener  are  in  noise  {X-X.  the  data  shown  in 
Fig.  1)  is  the  intelligibility  limited  by  !25-dB(C)  jet 
noise,  and  in  this  case  optimum  intelligibility  results 


°  for  noise-attenuation  results  on  man\  of  the  latest  ear  iro- 
tective  muffs,  see  Ref.  1 1.  When  used  in  sound-powered  equipment, 
see  Ref.  52  and  P.  O.  Thompson,  “A~mstic  Attenuation  of  Certain 
Hearing  Selectors  and  Sound- P«wr;cd  Telephone  Sets,”  X EL 
Tech.  Men:.  710  (17  July  1564;. 


from  listening  via  earphones  in  muffs  as  good  as  those 
with  liquid-filled  cushions.1 

What  techniques,  precautions,  and  circuitry  can  be 
utilized  to  optimize  talking  and  listening  in  noise?  The 
major  objective  is  to  maintain  a  sizeable  differential 
between  the  speech  signal  and  the  unwanted  noise  at 
the  ear  of  the  listener. 

To  maintain  a  satisfactory  speech-to-noise  differential 
at  the  input  of  a  communication  system  operated  in 
noise,  it  is  necessary  that  the  talker  increase  his  vocal 
output  as  the  ambient  noise  around  him  increases.  Tests 
in  noise,  with  talkers  using  noise-shielded  microphones 
and  wearing  muffs  over  earphones,  indicated  that  the 
level  at  which  the  talker  hears  his  own  voice  has  con¬ 
siderable  effect  on  his  vocal  output.5  Results  lor  a  single 
talker  showed  that  maximum  vocal  output  was  ob¬ 
tained  when  no  sidctor.c  was  provided.  As  sidetone 
level  was  increased,  the  talker  reduced  his  vocal  effort. 
Tests  with  several  talkers  showed  that  the  intelligibility 
of  speech  from  talkers  in  noise  did  not  change  signifi¬ 
cantly  as  sidetone  level  was  varied,  except  when  sidetone 
was  10  dB  over  the  preferred  level.  With  more  than  the 
preferred  amount  of  sidetone,  talkers  reduced  their 
vocal  effort,  the  S-X  dropped,  and  intelligibility  was 
adverseK-  affected.  Apparently,  for  less  than  preferred 
amounts  of  sidetone,  gains  in  speech-to-noise  differ¬ 
entials  counterbalance  any  distortion  incurred  as  the 
talker  approaches  maximum  vocal  effort,  even  though 
Pickett**  shows  that,  other  things  being  equal,  intelli¬ 
gibility  decreases  as  maximum  vocal  effort  is  reached. 

In  radio  systems,  if  the  speech  is  well  above  the  noise 
in  the  received  and  rectified  radio-frequency  signal,  the 
speech  should  be  intelligible  if  the  ambient  noise  leaking 
through  the  earmuffs  does  not  mask  out  the  speech 
from  the  earphones.  It  would  appear  that,  if  too 
much  of  this  noise  leaks  to  the  listener’s  ears,  a  satis¬ 
factory  S  X  could  be  preserved  by  mereK  increasing 
the  amplitude  of  the  speech  signal  from  the  earphones. 
Several  factors  limit  the  practicality  of  this  solution. 
The  principal  facior  is  comfort :  people  do  not  like  ex¬ 
tremely  loud  sounds,  whether  they  be  speech  or  noise. 
A  second  factor  is  safety:  loud  sounds  can  and  do  cause 
hearing  losses  it  listened  to  over  prolonged  periods. 

Concerning  the  comfort  aspect  of  listening  to  loud 
'pecch  sounds,  two  processing  techniques  can  be  used : 

(1)  A  noisc-a*tua»ed  automatic  volume  control 
(AVC)  can  be  used  so  th2l  the  voice  level  in  the  ear¬ 
phones  will  always  be  at  a  comfortable  and  intelligible 
level  relative  to  the  ambient -noise  background.  Pollack*5 
has  shown  that  such  an  AVC  dGcs  not  interfere  with, 
and  may  substantially  improve,  speech  intelligibility. 
A  small  microphone  in  the  noise  field  to  which  the 
listener  is  exposed  can  provide  the  noise  signal  to  acti¬ 
vate  the  AVC  circuit.  Use  of  the  AVC  circuit  has  the 
additional  advantage  of  simplifying  the  system  from 

**  j.  M.  Pickett,  J.  Acoost.  5oc.  Am.  2S,  902-905  (1956). 

“  i.  PoUadt,  J.  Acoust.  Soc  Am.  29,  1524-1327  (1957;. 
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the  user's  standpoint  by  eliminating  a  manual  volume 
control-  The  design  of  an  A  VC  circuit  can  be  based  on 
the  data  shown  in  Fig.  3.  For  the  curves  shown  in  Fig.  3, 
a  group  of  listeners  wearing  ear. nuns  ever  earphones 
adjusted  the  speech  level  from  their  <arphones  to  three 
criteria:  maximum  level  that  they  would  tolerate,  level 
preferred,  and  minimum  level  for  S0%-90%  rhyme- 
word  intelligibility. 

(2i  Peak  dipping  can  be  used  at  the  higher  listening 
levels.  For  example,  up  to  12  dB  of  clipping  has  a 
negligible  effect  on  speech  intelligibility,  does  not  de¬ 
stroy  voice  quality,  and  protects  the  listener’s  ears 
from  uncomfortable  and  potentially  deafening  speech 
peaks.** 

Both  of  these  techniques  have  been  used  in  the  flight- 
c“dc  radio  developed  at  the  U.  S.  Navy  electronics 
Laboratory.1  •*' 

Concerning  the  second,  or  safety,  factor  relative  to 
amplifying  speech  indefinitely  to  make  it  intelligible  in 
high-level  noise:  How  loud  can  a  sound  be  before  hearing 
losses  are  a  potential  problem?  Flanagan  and  Guttman1’ 
have  proposed  that  (for  exposures  over  the  years 
during  work-week  hours),  if  the  C  and  A  levels  are 
within  1  dB  of  each  other,  a  C  (or  A)  level  greater  than 
SO  dB  should  be  regarded  as  unsafe.  If  the  C-A  level 
difference  is  between  1  and  6  dB,  a  levei  of  85  dB(C) 
should  be  considered  unsafe.  If  C-A  is  between  6  and 
12, 90  dB(C)  is  the  limit,  and  if  C-A  is  greater  than  12, 
95  dB(C)  is  the  safe-unsafe  boundary.  Young13  has 
suggested  that  this  is  almost  tantamount  to  saying  That 
if  the  A  levei  is  less  than  80  dB  the  noise  is  sate. 

If  noise  levels  greater  than  80  dB(A)  for  a  working 
day  are  potentially  unsafe,*3  then  speech  levels  (at  the 
ear)  that  continually  exceed  ambient  noises  at  these 
levels  would  also  be  unsafe. 

In  general,  good  speech  intelligibility  in  high-level 
noise  has  been  obtained7-*7  by  using  (1)  a  noise-cancel¬ 
ling  microphone  (in  a  noise  shield*1-*5  for  extreme  noise) ; 
(2)  earphones  in  noise-attenuating  earmuffs,11  (3)  a 
wide  speech  bandwidth  (three  or  more  octaves  wide), 
centered  somewhere  between  1000  and  1SQ0  cps  (Ref. 
43) ;  (4)  minimum  or  no  sidetone1 ;  (5)  A  VC  circuit  to 


*•  J.  C.  R.  Lickiider  and  G.  A.  Milier,  "The  Perception  of 
Spiech,”  in  Hondboeh  of  Eipzrtmentol  Ftycholocy,  S.  5.  Stevens. 
Ed.  (John  Wilcv  &  Sons,  Inc..  New  York,  J95lj,  Chap.  26,  pp. 
10*1-1062. 

1  D.  C-  Gibson,  “CSNEL  Flight  Deck  Corntnenicaiicns System. 
Part  I.  Noise  Attenuating  Radio  Communications  Helmet  and 
Fired  Station  Equipment,”  NEL  Kept.  922  (14  Mar.  19W),  AD 
254  531. 

*  J.  L.  Flanagan  and  N.  Guttman,  J.  Acoust  See.  Am.  3*, 
1654-1653  (1964). 

a  R.  W.  Young,  persona1,  communication  and  footnote  on  Table 
III,  Ref.  33. 

*•  The  Revision  of  Document  43  (Secretariat  -194}  314  of  Techni¬ 
cal  Committee  43,  Working  Group  8  of  the  international  Organ* 
ration  for  Standardization,  dated  June  1964,  puts  this  safe-unsafe 
loundarv  level  at  90  dB(A}  for  5-h  davs. 

«  M.  S.  Hawley.  J.  Acoust.  Soc  Am.  23,  i256-1260  (1956). 
c  M .  S.  Hawley,  J.  Acoust.  Soc.  Am.  36,  ! 83- 190  (195S1. 


Fic.  3.  Rela¬ 
tive  earphone 
listening  level  vs 
noise  level  for  ac¬ 
ceptable  listening 
levels. 


conform  in  general  to  preferred  listening  level  as  shown 
in  Fig.  3:  (6)  peak  clipping  of  12  dB  at  maximum 
power3'3*-77;  and  (7)  flat  frequency  response  and  mini¬ 
mum  distortion  in  the  audio  circuitry  77 

It  is  often  thought  that  speech  picked  up  at  some 
anatomical  location  other  than  in  front  of  the  lips  would 
aid  speech  intelligibility  in  noise.  Studies4*-*7  concerned 
with  vibratory  or  bone  pickup  show  in  general  that  this 
is  not  so  and  that,  as  the  distance  from  the  larynx 
increases,  the  amplitude  of  pickcd-up  sound  decreases. 
Likewise,  ear-insert  microphones  work  well  in  the  quiet** 
but  are  not  in  competition  with  microphones  located 
in  front  of  the  lips  in  high  levels  of  noise.** 

An  evaluation  of  representative  vibratory  and  car- 
pickup  microphones  in  1957  stated*7  that,  “Transducer 
outputs  are  intelligible  ...  to  the  following  sound- 
pressure  levels  [of  jet  noise]:  137  dB  for  the  M-55 
rnoise-canceliing  dynamic  microphone  in  noise  shield]; 
120  dB  for  the  D-98  Airphone  Tan  ear-insert  phone]; 
110  dB  lor  the  PDR-S  fworn  over  the  ear];  120  dB  for 
the  YM-1  [[vibration  pickup]  on  the  forehead;  and  130 
dB  for  the  YM-i  on  the  mandible.”  Subtract  21  dB 
from  all  these  values  to  equate  to  a  0J/1/2  SIL  of 
equivalent  flat  (or  minus  6  dB/oct)  thermal  noise. 

A  comprehensive  U.  5.  Air  Force-sponsored  study  by 
the  Western  Electro- Acoustic  Laboratory  on  the  per- 


°J.  C  Webster.  "Generalized  Speech  Interference  Noise 
Contours,”  j.  Speech  Hearing  Res.  7, 133-141  (1964;. 

**  J.  Mulicndore,  “An  Experimental  Study  of  the  Vihrat:on  of 
the  Bones  of  the  Head  and  Chest  during  S.tslained  Vowel  Sounds," 
Speech  Monographs  16,  163-176  (1949). 

“  G.  von  Bektsy  and  W.  A.  RobcnbKth,  ‘The  Mechanical  Prop¬ 
erties  of  the  Ear,”  in  llotulbook  of  rjpcrzKtsSal  Psychology,  S.  S. 
Stevens,  Ed.  (John  Wiicy  &  Sons,  Inc.,  New  York,  1951),  Chap. 
27,  p.  1J11. 

**  H.  Moser  and  H.  J.  Oyer,  j.  Acoust.  Soc  Am.  36,  275-277 
(1958). 

a  J-  C.  Webster.  P.  O.  Thompson.  J.  Snidecor,  and  D.  D.  Wash¬ 
burn,  "Speech  Pickup  from  Various  .Anatomical  Locations,” 
Proc.  Decade  Basic  Appl.  Sd.  in  the  Navy,  On.  Naval  Res., 
Washington.  D.  C.  tl95?). 

”  H.  J.  Oyer,  J.  Acoust.  Soc.  Am.  27,  1207-1212  (1955). 

**  R.  D.  BUck,  J  Acoust.  Soc.  Am.  24,  260-264  (1957). 
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I  k:.  4.  Speech  intelligibility  (percent  I’B  ts^rd-,  correct,  tor 
speech  at  various  levels  vs  noise  level  of  a  -MB/oa  noise  (Fig.  2. 
Ref.  23)  and  (at  extreme  right)  a  hovering  HU  I*  neHcoptcr  noise 
(relative  octave  levels  of  —23,  —  S,  —9,  —7.0,  —3,  — S,  and  —16 
dB  for  octaves  centered  at  6',  125,  etc.}.  The  solid  liner  are  for 
open  ears,  the  dotted  litres  for  protected  cars  (plugs  cr  muffsi. 
The  majority  of  the  data  in  this  Figure  are  reploUed  from  Fig-  3, 
Ref.  2?  (with  abscissa  and  parameter  values  interchanged*. 
The  data  at  the  extreme  right  are  from  Ref.  6.  To  equate  the  t»«, 
sets  of  data,  use  is  made  (cn  the  bottom  abscissa)  of  the  0.5/1  fl 
SIL  of  — fc-dH/oct  thermal  noise.  Xo  equating  is  done  o  ■  the  ordi¬ 
nate  it  the  comparative  irUcfligibilm  cf  I’B  words14  lefti  and 
multiple -choice  words—  (right). 


lormancc  of  electroacoustic  transducers  in  noise  has 
been  reported  orally50'5*  but  as  yet  appears  only  in 
company  reports,51  and  the  data  arc  ’oo  extensive  to 
summarize  here.  It  was  concluded  that  a  microphone 
embedded  in  a  dental  plate  or  otherwise  fastened  to 
the  lower  teeth  inside  the  mouth  showed  promise  for 
communicating  in  high  noise  levels.55 

For  communicating  in  high  levels  of  noise,  the  old 
reliable  carbon  button  has  had  its  day.  At  ambient 
levels  above  80  dB  over  all  ;et  noise  -'50-dB  equivalent 
0-5  1  2  SIL  of  flat  TX  and  TX-6),  dynamic  (and  con¬ 
denser)  microphones  .rill  give  better  intelligibility 
scores.®-*-* 

V.  AMPLIFIED  VOICE  COMMUNICATION, 
LOUDSPEAKER  LISTENING 

What  are  the  limbing  levels  of  noise  m  which  ampli- 
tied  speech  can  be  heard  over  a  loudspeaker?  Figure  4 
is  a  replot  of  .some  data  of  Kryter  s  {Fig.  3  in  Ref.  23). 
In  this  rcplot,  the  abscissa  and  pavamete*  dimensions 
are  interchanged  from  Kryter  s  original  Fig.  3.  In  this 

“■  \V.  B.  Snow,  J.  Acoust.  Soc  Am.  33.  1661  (A>  « 1961  *. 

H  I*  S.  Vend. laser;,  j.  Acoust.  See.  Am,  33,  1661  »A)  *1961  > 

#  j.  I*.  ChrsioS,  J.  Acoust.  See  Am.  33,  1662(A)  (1961  >. 

“"Slu.-Jx  anil  Investigation  of  Specialized  Electro  Acoustic 
Transducers*  for  Voice  Communication  in  Aircraft."  Western 
Ebctro-Acousi  Lab.,  Inc.,  Final  Rept.  with  Svppl.  A;ip.  1-6, 
Contract  AF33(616)-3710,  Task  Xo.  43060  (Feb.  1959). 


form,  it  is  easy  to  see  how  speech  intelligibility  for 
loudspeaker-reproduced  amplified  speech  is  limited  by 
ambient  noise  and  how  this  varies  with  (1)  amplified 
speech  level  and  (2)  with  the  wearing  of  earplugs 
(dotted  lines),  versus  open  ears  (solid  lines).  The  re¬ 
maining  alteration  to  the  original  Kryter  data  is  that 
noise  level  is  now  plotted  as  the  0.5  1  2  SIL  (of  the 
— 6-dB  oct  thermal  noise)  instead  of  as  the  over-all 
level. 

When  these  two  transformations  are  made,  many 
comparisons  are  possible.  For  example,  note  the  shaded 
areas  around  the  85-dB  speech-level  curves.  This  area 
delimits  Kryter  *s  data  on  his  Fig.  7  where  talkers  and 
listeners  were  in  a  face-to-face  communication  situation 
at  a  distance  of  7  ft.  The  talkers  (just  as  the  talker  used 
to  generate  the  data  in  Fig.  1 )  ‘‘spoke  at  whatever  voice 
level  they  thought  necessary  to  make  themselves 
understood”  (Ref.  23,  p.  416).  The  upper  edge  of  the 
shaded  area  is  when  neither  talkers  nor  listeners  wore 
earplugs  and  the  lower  boundary  is  when  both  talkers 
and  listeners  did  wear  earplugs.  The  original  data  shov.- 
points  in  between,  -here- one  or  the  other  wore  plugs. 

The  6  ft  curve  from  Fig.  ’  (lowered  11)  percentage 
points  to  account  lor  the  difference  between  I’B  words 
correct  and  rhyme  words  correct,*  i.e.,  59%  PB  score 
=  60%  rhyme  store)  would  tit  in  this  same  shaded  area. 
Bcraneks17  8-ft.  86-dB  speech  level,  treated-aircraf-- 
cabin  noise  curve  would  also  fall  in  the  shaded  area 
(between  the  limits  of  40%  and  80%). 

It  should  be  noted  in  passing  that  all  data  in  Fig.  4 
(and  the  nonearpisone  data  in  Fig.  1)  were  taken  in 
semireverberant  (certainly  nonanechoic)  rooms.  Kry  ter 
shows  (Ref.  23,  Fig.  5)  that  for  open  cars  the  intelligi¬ 
bility  in  an  anechoic  room  is  10%  greater  than  in  his 
reverberant  room.  For  plugged  ears,  the  10%  holds  only 
at  high  levels  of  intelligibility.  And  Thompson,  Webster, 
and  Gales51  have  shown  that  any  amount  of  re-verber- 
atLn  added  to  “dead”  speech  causes  a  decrement  in 
intelligibility. 

Also  shown  in  Fig.  4  is  a  summary  of  the  data  relating 
SI  1 voice  level,  and  distance  between  talker  and  listener 
(ccrrcctcd  from  0.6  4.8  S!l  to  0.5  1  2  SIL).  The  same 
rationale  and  explanations  as  used  when  the  SIL  data 
were  superimposed  in  Fig.  1  also  apply  here. 

At  the  top  of  the  Figure  are  cutoff  points  delimiting 
the  subjective  opinions  of  people  relating  to  the  relative 
noisiness  of  "stenograph;*,  and  large  engineering  drafting 
rooms. ”=  [Fxcctit i ve  ofiicc  personnel  tolerate  10  dB  less 
noise:  «h3t  is,  >hcy  think  that  offices  art  “noisy"  at 
an  05  1  2  SIL  of  91  dB  (not  7*)  dB),  and  "very  noisy" 
at  an  0.5  1  2  SIL  of  70  dB.) 

At  the  bottom  of  Fig.  4  arc  the  opinions  of  the  people 
surveyed  by  Beranek" "  relative  to  measured  noise 
level  {here  re-plotted  as  0.5  1  2  SIL)  and  their  ability 
to  use  the  telephone. 

“  P-  O-  Thompson,  J.  C.  V.'rijster,  and  R.  S.  Gale*,  J.  A  roust. 
Soc  Am.  33,  604-605  (1961). 
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To  augment  the  data  of  Kryter3  and  Beraneh,”  - 
some  data  of  Wdister,  Thompson,  and  Ik-hither*  are 
added  !o  the  right  in  Mg.  4.  The  noise  was  ‘  hovering 
helicopter,”  the  words  were  "multiple  choice,’""'  and 
muffs  were  used  instead  of  plugs,  but  the  data  appear  l«» 
be  consistent  with  Krxter’s3  and  do  extend  the  repro 
duced  level  of  speech  and  noise  slighiiy  bee  ond  the 
limits  tested  by  Krytcr. 

Some  general  conclusions  to  be  drawn  from  Fig.  4 
and  the  data  on  which  it  is  based  are.  At  noise  and  or 
speech  levels  above  some  point,  intelligibility  is  better 
who;  listeners  and  or  talkers  arc  wearing  earplugs3  or 
earmuffs.*  This  limited  noise  level  is  roughly  at  a  03  !  2 
SIL  of  70  clB  and  the  limiting  speech  level  is  95  dB.  Since 
people  dislike  wearing  hearing  protectors  in  marginal 
levels  of  noise  (and  don’t  believe  that  thee  can  hear 
better),  2  Oi  l  2  SIL  of  Sh  dB  should  proltabH  be 
chosen  as  a  point  where  some  sort  of  pressure  should  l>e 
brought  to  bear  to  insist  that  ear  protection  be  wont.  A 
9.5  '1  '2  SIL  of  SO  dB  corresponds  roughly  to  an  A  level 
of  90  dB  (sec  Table  I  or  Ref.  12).  where  hearing  pro¬ 
tection  should  be  went  as  a  protection  against  noise- 
induced  permanent  hearing  loss  with  S-h,  5-day,  5H- 
week,  muhipic-ycar  exposures.14 

Another  conclusion  from  Fig.  4  and  its  sources*3  3 
is  that  at  any  constant  speech -to  noise  differential  the 
intelligibility  is  remarkably  constant,  rising  some  at  low 
noise  levels  and  falling  some  at  high  noise  lex-els  (even 
when  wearing  ear  protectors).  This  slight  falloft  of 
intelligibility  with  lex-el  has  been  found  by  others**  and 
suggests  that  the  equivalent  loan  ear -o'-erioad  .ondition 
exists,  just  as  Pickett1*  has  shown  a  similar  voice  over- 
load  (or  at  least  a  decrement  of  intelligibility  with 
greatly  increased  vocal  effort). 

It  is  interesting  to  note  that  “noisx  ”  and  xtn  noisy  " 
spaces  are  associated  with  SIL’s  such  that  “shouting” 

u  J.  tV.  Black.  "MtihijAr  Clwecc  laldfiplaoi}  Tests.’’  J 
Sf-ecch  Hcannsr  IHsonkrs  22.  215  256  i  !957  ». 
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or  “very  !owi“  v«kc  kxds  ior  95  *!B  spce«  h  levels)  are 
required  i or  conversations  at  13  or  5  ft,  ami  thi<  is  ihe 
region  where  telephone  conversations  are  judge*!  to  tie 
\lit7i.  -alt”  or  “unsalisfaclon."  All  of  those  adverse 
nois<  oitxiitiuisuuui  at  the  region  where  ear  protection 
will  aid  intelligibility  and  at  the  Iknmdarx  where  ear 
protection  should  l>c  Used  !«•  protect  again:.!  hearing 
losses.  It  is  quite  apparent  that  0.5  1  2  SIL’s  abox-c 
70  dB  shouk!  be  avekkd  in  spates  where  people  must 
converse  or  communicate  via  some  commimkalion 
devke.  Spaces  w  here  t  on  versa!  inns  cannot  be  carried 
on  in  comfort  at  5  it  ate  too  noisx  for  any  type  of  task 
requiring  fate  to  face  communications.  XoiseprooSng 
of  humans  and  equipment  should  start  at  this  noise 
level.  At  0.5  1  2  SIL’s  greater  than  9fi  dB  the  wearing 
of  hearing  protection  should  be  mandatory  and  every 
noiseprooting  technique  (except  a  noise  shk-kl  for  the 
microphone!  should  be  employed.  At  Oi  i  2  SIL’s 
above  10?)  dB,  every  noiseprooting  technique  should  be 
employed. 

Beranck’s3  recommendation  that  ’.-*.6  45  SIL’s  above 
55  dB  (03  "1  2  SIL’s  of  ft)  dB)  should  avoided  for 
office  spaces  appears  to  be  a  good  choke  Levels  10  dB 
greater  than  this  should  be  avoided  even  or,  ships  where 
ventilation  and  other  noises  are  eve-  present  and  their 
q« acting  expensive. 
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Effects  of  Ambient  Noise  and  Nearby  raikers  on  a  f  ace-to-f  ace 

Communication  Task 

John  C.  Wku.ntkr  and  Roy  G.  Ki.uitri* 
l'.  S.  Navy  Electronic*  laboratory.  San  Diego  52,  California 
(Received  February  26,  1962) 

From  1  to  5  talk.r-listener  pairs,  talkers  seated  shoulder-to-shoultler  on  one  side  o  1  a  tabic  with  listeners 
on  the  other,  communicated  word  lists  in  conditions  of  <,uiet  and  ambient  thermal  noise  levels  of  65,  75, 
and  85  dB.  Each  talker  read  one  nord  at  a  time  to  his  listener-partner,  who  repeated  back  each  word  for 
verification  by  the  talker.  Talker -listener  pairs  were  instructed  to  maintain  an  accuracy  of  VO'Ji  or  better. 

For  the  lower  ambient  levels  the  sjreech  level  of  a  central  pair  increased  aUiut  5  <IH  for  an  additional  10  dB 
of  wise  or  for  each  doubling  of  the  number  of  pairs  around  them.  The  rate  of  utterance  decreased  with 
noise  but  showeu  no  clear-cut  pattern  of  change  as  the  number  of  additional  talkers  was  varied.  Accuracy 
of  communication  was.  on  the  average,  ()V,i  and  was  never  below  84ri .  Communication  errors  defy  simple  de¬ 
scription  but  in  general  (!)  for  a  constant  noise  Icy  cl,  increasing  the  number  of  talker,  results  in  increasing 
errors;  and  (2)  for  3  or  fewer  talker-listener  pairs,  |>ereont  error  docs  not  increase  until  the  ambient-noise 
level  reaches  85  dB. 


■INTRODUCTION 

IT  is  often  necessary  to  assess  the  suitability  of  a 
room  (existing  or  proposed)  for  speech  communica¬ 
tion,  from  a  knowledge  of  measured  or  expected 
ambient-noise  levels  in  the  room.  The  prediction  is  as 
often  as  not  based  soiely  on  physically  existing  noises 
(fans,  vehicular  noise,  etc  )  and  may  not  adequately 
account  for  the  cumulative  noise  effects  due  to  the 
people  within  the  room  performing  voice  communica¬ 
tion  tasks.  The  puqiose  of  this  study  is  to  determine 
the  interaction  between  physically  introduced  ambient 
noise  and  speech  levels  of  nearby  talkers  on  the  per¬ 
formance  of  talker-listener  pairs  performing  a  com¬ 
municating  task  (exchanging  Harvard  PB  words1). 
The  variables  measured  are;  speech  level  required  to 
communicate  in  increasing  levels  of  physical  and  or 
talker-generated  noise,  rate  of  utterance,  and  the  num¬ 
ber  of  errors. 

APPARATUS 

All  testing  was  done  in  a  28X16X10  ft  recording 
studio.  “Quiet”  levels  in  the  studio  were  55  dB  or  less 
on  the  C  scale  (35  dB  or  less  on  the  A  scale)  of  a  sound- 
level  meter.  Reverberation  time  of  the  unoccupied  studio 
was  approximately  0.25  sec. 

The  talker  and  the  listener  of  a  ventral  pair  were 
seated  across  from  one  another  at  the  center  of  a  table- 
measuring  60X34  in.  When  a  second  and,  or  third  pair 
was  added,  they  were  placed  beside  the  center  pair  so 
that  all  talkers  were  on  one  side  of  the  table  and  all 
listeners  on  the  other.  Talkers  were  seated  shouldcr- 
to-shoulder  and  listeners  were  shouider-to-shoulder. 
Mouth-to-ear  distance  between  any  talker-listener  pair 
was  36  in.  Center-of-head-to-center-of-head  distance 
between  adjacent  talkers  (or  listeners)  was  approxi¬ 
mately  17  in.  Paris  4  and  5,  when  used  to  generate 
additional  babble,  sat  at  wo  small  additional  tables 
placed  one  at  each  end  of  the  larger  central  table. 

1  J.  P.  Egan,  Laryngoscope  58,  955-991  (1948). 


Ambient  noise  was  obtained  from  three  speaker 
arrays  driven  by  amplifiers  and  a  thermal  noise  gen¬ 
erator.  The  noise  field  about  the  table  was  uniform  to 
within  ±1  dB  as  measured  on  the  C  scale  of  a  sound- 
level  meter.  The  frequency  response  f  the  ambient- 
noise  system  was  ±5  dB  from  300  to  6000  cps  as 
mcasuied  with  warble  tones  and  a  microphone  located 
above  the  center  of  the  tabic. 

Six  matched  microphones  were  used  to  record  the 
outputs  of  the  six  subjects  (3  talkers  and  3  listeners) 
from  whom  data  were  taken.  Microphones  were  fastened 
to  the  table  in  front  of  these  six  subjects.  Each  person 
|K>sitioned  himself  so  as  to  have  his  lips  almost  touching 
the  edge  of  the  microphone.  Li))  distance  to  center  of 
mic'ophont  was  approximately  1  in.  Microphones  were 
oriented  so  as  to  minimize  breath  blast.  ( lutputs  of  the 
six  microphones  were  recorded  on  tape  and  were  later 
analyzed  for  error,  rate  of  utterance,  and  speech  level. 

Speech  level  was  determined  by  playing  the  tapes 
back  at  half  the  recorded  speed  and  tracing  the  speech 
signals  on  a  Bruel  and  Kjaer,  type  2304  power  level 
recorder  using  a  writing  rate  of  1GO0  dB  'sec.  Sound- 
pressure  levels  were  assigned  to  these  speech  tracings 
by  determining  the  difference  between  the  speech  peaks 
and  the  background-noisc  level.  The  background-noise 
level  had  previously  been  measured  with  a  sour.d-levei 
meter  set  to  C,  “fast.” 

The  level  of  each  word  was  noted  to  the  nearest 
decibel  ard  the  median  level  for  a  list  was  determined. 
Medians  were  used  instead  of  means  because  of  the 
reduced  analysis  time  required.  Later  analysis  showed 
that  merii  ans  differed  from  mean  values,  in  24  lists 
sampled,  by  less  than  ±1  dB.  In  general  the  quartile 
values  differed  from  the  median  by  about  1.5  dB  and 
never  differed  by  over  5  dB. 

PROCEDURE 

From  2  to  10  people  were  used  as  subjects,  always 
acting  in  pairs.  They  were  members  of  the  U.  S.  Navy 
Electronics  Laboratory,  had  normal  or  near  norma! 
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hearing,  and  were  experienced  in  noise  communication 
tests.  One  member  of  the  pair,  the  talker,  read  words, 
one  at  n  time,  from  a  list  of  PB  words.  His  partner,  the 
listener,  repeated  back  each  word  as  he  heard  it.  If  the 
talker  was  satisfied  that  the  listener  had  repeated  the 
word  correctly,  he  proceeded  on  to  the  next  word.  If 
not  satisfied  as  to  the  correctness  of  the  response,  the 
talker,  acting  on  previous  instructions,  repeated  the 
word  and  again  ascertained  the  listener's  response.  Each 
pair  was  instructed  to  maintain  a  high  degree  of  ac¬ 
curacy  (90%  to  100%)  but  not  to  repeat  any  given  word 
more  than  three  times.  Maintaining  the  required  ac¬ 
curacy  was  the  responsibility  of  the  talker,  who  had 
IS  words  before  each  test  list  to  adjust  his  vocal  output 
until  he  was  satisfied  with  the  level  of  accuracy  of  his 
listener’s  responses.  He  was  cautioned  by  the  experi¬ 
menter,  who  monitored  each  list,  that  any  list  with 
excessive  errors,  would  be  rerun.  No  lists  were  actjally 
rerun  (although  scores  between  84  and  90%  did  occur 
on  2$%  of  the  lists).  They  were  also  instructed  to 
deliver  the  words  as  rapidly  as  possible  while  maintain¬ 
ing  required  accuracy.  Th's  was  done  to  minimize  the 
possibility  of  getting  in  synchrony  and  delivering  words 
in  the  interval  between  words  of  an  adjacent  pair. 
Subjects  were  instructed  to  ignore,  as  best  they  could, 
the  ambient  noise  and  the  speech  from  adjacent  pairs. 
No  instructions  were  given  as  to  exactly  what  vocal 
output  would  be  required. 

The  results  of  3  pairs  were  scored  although  5  pairs 
were  used  in  some  parts  of  the  experiment.  Pairs  4  and 
5  performed  the  same  tasks  as  the  three  centrally 
located  pairs  but  their  results  were  not  scored.  They 
served  simply  as  “noise”  sources. 

There  were  four  groupings  of  talker-listener  pairs: 
each  pair  alone,  two  pairs  together,  all  three  data- 
yielding  pairs,  and  the  three  data-yielding  pairs  plus 
two  more  babble-contributing  pairs.  There  were  10 
experimental  sessions;  each  pair  alone  (3)  replicated 
once  (total  of  6),  pair  1  vs  pair  2,  pair  1  vs  pair  3, 
pairs  1,  2,  and  3  together,  and  pairs  1,  2,  3  plus  pairs 
4  and  5.  Within  each  of  the  10  sessions,  tests  were 
carried  out  in  the  following  order  of  noises:  Quiet,  65, 
75,  85,  85,  75,  65,  Quiet.1  Each  test  consisted  of  reading 
75  PB  monosyllabic  words  (but  scoring  only  the  middle 
50  words  which  comprised  a  complete  PB  list).  The 
words  preceding  the  actual  test  words  allowed  a 
“settling  in”  period  for  adopting  a  mutually  acceptable 
level  and  rate.  The  words  following  ensured  a  constant 
babble  level  for  the  pairs  that  were  still  communicating 
after  the  faster  pairs  had  finished. 

The  experiment  was  designed  so  that  (1)  no  adjacent 
pair  read  the  same  list  at  any  time  during  a  session, 
(2)  the  extra  words  at  the  beginning  and  end  of  a  list 
were  taken  from  lists  not  used  in  that  particular  test 
session,  and  (3)  during  any  given  test  session  each 

1  In  the  condition  where  5  pairs  were  communicating,  only  4 
noise  conditions  were  used  (the  ascending  sequence  of  Quiet,  65, 
75,  and  85  dB). 
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Fic.  1.  Types  of  errors  made  by  talkers  T  and  listeners  L.  The 
eight  word  sequences  shown  at  the  left  encompassed  every  type 
of  error  found  in  these  tests.  The  type  of  error  and  the  way  these 
errors  were  summed  are  shown  at  the  right. 

pair  read  a  different  list  for  each  condition.  Preliminary 
runs  indicated  that  seeing  the  talker’s  lips  made  no 
difference  in  any  criterion  measure  of  accuracy,  rate, 
or  speech  level,  and  so  no  screen  was  used  between 
talkers  and  listeners. 

TREATMENT  OF  DATA 

The  raw  data  consisted  of  magnetic  tape  recordings 
of  three  talkers  and  three  listeners.  These  lists  of  50 
words  were  analyzed  for  (1)  speech  level,  (2)  rate,  based 
on  the  time  taken  for  the  talker  to  say  and  the  listener 
to  repeat  10  consecutive  words  in  a  stable  (error-free, 
repeat-free)  portion  of  their  performance,  and  (3) 
number  of  errors. 

To  understand  the  variety  of  possible  errors,  refer 
to  Fig.  1.  For  any  one  word  the  sequence  of  events 
which  might  transpire  between  talker  T  and  listener  L 
is  shown  in  Fig.  1.  The  symbol  A  represents  the  correct 
word  while  B  and  C  represent  incorrect  responses  to 
A.  Actually  the  talker  by  definition  never  said  an 
incorrect  word.  If  he  misread  a  word  on  his  list,  his 
misreading  became  the  new  correct  word.  On  the 
average,  94%  of  the  responses  were  correct,  that  is, 
sequence  1.  The  most  common  incorrect  sequence  was 
sequence  2,  which  sometimes  got  lengthened  into  se¬ 
quence  3.  Sequences  2  and  3  involve  listener  errors  LE 
but  no  talker  errors.  Sequence  4,  which  occurred  most 
typically  in  the  higher  ambient-noise  conditions,  in¬ 
volves  an  error  on  the  part  of  both  the  listener  and  the 
talker.  The  talker’s  error  is  that  he  failed  to  detect  the 
listener  error  and  therefore  failed  to  correct  it  (TOE  or 
Talker  Omission  Error).  Sequence  5  is  purely  a  talker 
error:  He  thought  the  listener’s  correct  response  was 
incorrect  and  repeated  a  word  he  need  not  have  re¬ 
pealed  (TCE  or  Talker  Commission  Error).  This  type 
of  error  sometimes  sets  off  a  longer  chain  of  errors  like 
sequences  7  and  8  where  the  listener,  because  of  the 
error  of  the  talker,  assumed  he  was  wrong  and  changed 
his  initially  correct  response  to  one  (7)  or  more  (8) 
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Fig.  2.  Speech  level  vs  noise  level  of  the  center  pair  only.  7  he 
midpoint  along  the  abscissa  represents  the  quietest  condition,  a 
single  talker-listener  pair  in  the  quiet  recording  studio.  To  the 
right  of  center,  thermal  noise  was  added  to  make  ambient  room 
noise  levels  at  65,  75,  and  85  dll  (C  scale).  To  the  left  the  nund>er 
of  t  ornmunicating  [rairs  was  successively  doubled  by  the  addition 
of  1.  2,  and  1  pairs  of  talker-listeners.  The  parameters  on  the 
left  hand  side  of  the  figure  represent  greater  noise  levels,  and  on 
the  right,  more  people.  The  same  data  are  plotted  on  the  left  and 
right  halves.  The  two  halves  differ  only  in  that  the  abscissa  and 
parameter  designations  arc  interchanged. 

The  datum  points  for  “no  additional  pairs"  and  “plus  1  pair" 
represent  results  from  8  different  lists  of  50  PB  words.  Four  lists 
are  represented  for  “plus  2  pairs"  and  two  lists  for  “plus  4  pairs." 

incorrect  responses.  Sequence  6  is  ini  ‘Jier,  though  rare, 
combination  of  listener  and  talker  errors.  In  plotting 
resuits,  all  three  types  of  errors  were  combined  into  a 
single  error  score,  i.e.,  E=I«E+TOE-i-TCE. 

The  number  of  repeats  is  ordinarily  an  important 
measure  of  t  emmunications  success.  However,  since 
the  lack  of  a  repeat,  .equence  4,  is  as  serious  as  the 
presence  of  -i  repeat,  the  total  number  of  repeats  is  not 
a  good  performance  measure  on  this  task.  The  in¬ 
formation  on  repeats,  or  lack  of  them,  is  contained  in 
the  combined  error  score. 

R5S0LTS 
Speech  Level 

When  avenged  over  all  three  pairs,  the  speech  levels 
of  the  listeners  did  not  differ  from  the  speech  levels  of 
the  talkers.  Therefore  in  the  results  the  speech  levels 
of  the  talkers  and  the  listeners  are  averaged. 

Figure  2  shows  how  the  speech  level  of  the  center 
talker-listener  pair  changed  with  ambient  thermal  noise 
level  and  number  of  adjacent  competing  talker-listener 
pairs.  Median  speech  level  is  plotted  on  the  ordinate 
and  along  the  abscissa  are  measures,  direct  or  indirect, 
of  noise  ieveis.  The  speech  icvels  on  the  ordinate  are 
levels  measured  approximately  !  in.  from  the  talker’s 
mouth  (at  the  listener’s  car,  36  in.  away,  the  measured 
level  was  IS  dB  less).  The  minimum  speech  level  of 
94  dB  (center  pair  alone  in  the  quiet)  corresj»onds  to 
“normal”  speaking  effort  and  the  maximum  of  117  dB 
(“plus  4  pairs'*  and  “85  dB”)  to  a  “shout.”  The  true 
long-term  rms  speech  level  at  1  m  in  a  free  field  would 
be  34  dB  less  than  the  94  dB  (normal)  and  117  dB 
(shout)  values  shown  (31  dB  for  distance  doubled  and 


3  dB  for  peak  to  rms  correction  of  the  sound-level 
meter  reading). 

From  the  center  to  the  right  along  the  abscissa  the 
ambient  room  noise  increases  in  10-dB  steps.  The  break 
in  the  abscissa  reflects  the  fact  that,  although  the  C 
scale  level  for  the  “quiet”  Q  condition  is  10  dB  below 
the  65-dB  thermal-noise  level,  the  difference  in  A-scale 
levels  is  about  30  dB.  The  A  level  is  more  closely 
related  to  speech  interference  than  is  the  C  level.-*  The 
parameters  on  the  curves  to  the  right  are  the  number 
of  additional  pairs  communicating:  The  curves  are  (1) 
the  center  pair  by  themselves,  (J)  with  1  additional 
pair  alongside,  (3)  with  2  additional  pairs  (one  on  each 
side  of  the  center  pair),  and  (4)  with  4  additional  pairs 
(two  pairs  on  each  side  of  the  center  pair). 

From  the  center  to  the  left  the  number  of  pairs  of 
communicators  around  the  center  pair  doubles  and  the 
parameters  on  the  curves  are  the  ambient-noise  levels. 
The  same  data  are  plotted  on  the  left  as  on  the  right. 
The  difference  is  that  the  abscissas  and  parameters  are 
interchanged. 

This  symmetrical  plotting  of  the  data  is  done  to 
emphasize  the  comparison  between  the  effects  on  sjieech 
level  of  “noise  ievel”  and  “number  of  communicators.” 
The  curves  are  within  ±2  dB  of  being  bilaterally 
symmetrical  around  the  center.  This  implies  a  general 
equivalence  between  (1)  adding  10  dB  of  noise  and 
(2)  doubling  the  number  of  people.  Adding  lOdBof  noise 
essentially  doubles  the  subjective  loudness.4  If  speech 
levels  of  talkers  surrounded  by  noises  be  considered  an 
indicator  of  “loudness”  then  doubling  the  number  of 
communicators  likewise  doubles  the  loudness.  In  the 
lower  ambient-noise  levels  the  amount  of  speech-level 
shift  with  10  dB  of  noise,  or  doubling  of  communicators, 
is  roughly  5  dB,  In  the  highest  ambient-noise  conditions, 
speech  level  is  approaching  maximum  possible  sus- 


Fic.  3.  Speech  level  vs  noise  level  of  the  3  data-yielding  pairs 
(all).  The  or  natc,  abscissa,  and  parameters  have  the  same 
meaning  as  in  Pig.  2.  Each  datum  point  on  the  “no  additional 
pairs”  represent:,  24  lists,  nn  the  “plus  1  pair”  16  lists,  on  the 
“plus  2  pairs”  12  iLrts,  an.i  on  the  “plus  4  pairs"  6  lists. 


*  R-  G._  Kjumpp  and  J.  C.  Webster,  "  jpecch  Interference 
Aspects  of  Navy  Noises,"  U.  S.  Navy  Eh-ctronics  Laboratory 
Tech.  Rept.  (to  be  published). 

*5>.  S.  -Stevens,  J.  Acou-t.  Soc.  Am.  28,  807-8-2  (1956).  See 
Fig.  19. 
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tained  level5  * :  l’li  vsiologii  al  limitations  keep  the  spew  h 
level  from  continuing  to  increase  linearly. 

An  increase  in  ambient  thermal  noise  alTeits  all 
talkers  in  the  room  equally,  hut  adding  people  sym¬ 
metrically  around  the  center  pair  affects  the  center 
pair  more  (or  at  least  differently)  than  any  of  the  other 
pairs.  In  Fig.  3  the  speech  levels  of  all  3  pairs  are  con¬ 
sidered.  The  same  general  trends  are  evident  in  Fig. 
3  as  in  Fig.  2 ;  i.e.,  speech  level  increases  with  increasing 
ambient  noise  or  additional  adjacent  talkers.  Two  kinds 
of  dilTerences  show  up.  First,  the  inherent  speech  levels 
of  the  two  noncentral  pairs  are  less  than  the  center 
jKiir,  so  with  “no  additional  pairs”  the  speech  level  is 
down  about  4  dB  on  Fig.  3. 

The  second  difference  has  to  do  with  geometry  of  the 
test  situation.  When  alone  or  with  one  additional  pair, 
the  center  pair  and  the  two  noncentral  pairs  are  in 
essentially  the  same  test  situation.  However,  with  two 
additional  pairs  the  center  pair  has  a  pair  on  each  side 
while  each  noncentral  pair  has  the  additional  people  on 
one  side  of  them  only.  To  the  noncentral  pairs,  the 
“plus  2  pairs”  condition  is  more  like  "plus  one  and  a 
fraction.”  The  results  reflect  this  inasmuch  as  the  “plus 
2  pairs”  data  on  Fig.  3  lie  between  the  plus  1  and  the 
plus  2  of  Fig.  2.  Likewise  with  four  additional  pairs  ir. 
the  room  the  center  pair  is  surrounded  by  two  pairs 
on  each  side  whereas  the  noncentrai  (but  nonend)  pairs 
have  three  pairs  on  one  side  of  them  and  one  pair  on  the 
other.  The  “plus  2  pairs”  condition  for  the  center  pair 
is  more  equivalent  to  the  “plus  4  pairs”  for  the  non- 
central  pairs,  and  the  data  reflect  this.  If  the  left-hand 


Fig.  4.  Voice  sjiectra  of  the  talker  from  the  center  pair  at 
various  noise  conditions.  Over-all  (broad-band)  levels  are  shown 
for  each  condition  at  the  extreme  left.  These  over-alls  represent 
the  talker  portion  of  the  data  plotted  on  Fig.  2  at  the  specified 
(larametric  points. 

1  j.  M.  Pickett,  J.  Acoust.  Soc.  Am.  2S,  902-905  (1956).  See 

rig.  i. 

•Western  Electro-Acoustics  1-ab.  Final  Rept.  Contract  At 
33(616)-3710  Task  No.  43060,  “Study  and  Investigation  of 
Specialized  Electro-Acoustic  Transducers  for  Voice  Communica¬ 
tion  in  Aire  rail."  Appendices  1-6,  (February  1959).  See  Fig. 
A4  9. 
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Fig.  5.  Rate  in  words  per  second  to  read  10  error-free  words  as 
a  function  of  ambient  noise  and  number  of  communicators. 
Abscissa  and  parameters  have  same  meaning  as  in  Figs.  2  and  3, 
and  the  data  are  liased  on  the  same  number  of  observations  as  in 
Fig.  2. 

abscissa  of  Fig.  3  (and  the  right-hand  parameters) 
were  relabeled  as  “no  additional  pairs,”  “plus  1,” 
“plus  1+,”  and  “plus  2+,”  the  results  on  Fig.  3  would 
(except  for  the  inherent  weaker  s;>eech  levels  alone) 
very  nearly  coincide  with  the  results  shown  on  Fig.  2 ; 
the  agreement  is  within  2  dB. 

Figures  2  and  3  show  how  the  over-all  speech  level 
increases  with  noise,  but  what  happens  to  the  voice 
spectrum  as  the  over-all  speech  level  increases?  The 
voice  spectrum  of  the  talker  of  the  center  pair  was 
analyzed  in  c.etail  from  selected  samples  of  seven  of 
the  different  experimental  conditions  shown  in  Fig.  2. 
Figure  4  shows  the  level  in  half-octave  bands  under 
the  conditions  of  Quiet  and  at  65,  75,  and  85  dB  of 
ambient  noise,  and  in  the  75-dB  level  with  1,  2,  and  4 
pairs  of  communicators  around  him.  His  over-all  level 
increases  in  order  through  each  of  the  conditions  just 
listed  (see  Fig.  2)  except  for  the  “no  additional  pairs” 
at  85  dB  vs  the  “plus  one“  at  75  dB,  which  are  roughly 
equal.  Note  that  as  the  over-all  level  increases  more 
and  more  energy  is  shifted  to  the  higher  frequencies.7 
The  two  levels  which  in  over-all  level  are  roughly  equal 
(alone  at  85  and  1  pair  at  75)  have  nearly  identical 
spectra.  From  these  selected  samples  it  appears  that 
spectrum  is  more  dependent  on  level  than  upon  what 
causes  the  level  to  be  where  it  is  (i.e.,  it  makes  little 
difference  whether  it  is  thermal  amaient  noise  or 
fellow  communicating  colleagues  that  cause  increased 
speech  level ;  for  equal  speech  level  the  spectrum  is  the 
same). 

Rate 

Figure  5  shows  the  same  type  of  plot  as  Fig.  2 
except  that  rate  (words  per  second)  is  plotted  on  the 
ordinate.  N’otc  the  complete  lack  of  symmetry  as  con- 

1  J.  C.  R.  Licklidcr,  M.  E.  Hswlev,  R.  A  Walkling.  J.  Acnu^t. 
Soc.  Am.  27,  207(A)  (1955). 
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Fic.  6.  Errors  of  the  center  pair  as  a  function  of  ambient  room 
noise  and  number  of  competing  communicators.  Abscissa  and 
parameters  as  in  Figs.  2, 3,  and  5.  Data  are  based  on  same  number 
of  observations  as  in  Fig.  2. 

trusted  to  Fig.  2.  The  rate  of  utterance  of  the  center 
pair  depends  to  a  greater  degree  on  ambient  noise  than 
on  talker-generated  noise.  Contrary  to  the  speech-level 
results,  increasing  the  ambient  noise  decreases  the  rate 
of  talking.  In  general,  increasing  the  number  of  people 
increases  the  rate  of  talking,  but  there  are  strong  in¬ 
teractions  with  room  noise.  The  rate  data  for  “all” 
communicators  are  not  shown  inasmuch  as  they  do  not 
show  any  systematic  deviation  from  the  "center”  data. 

Errors 

Figure  6  shows  tlte  percentage  of  errors  made  L\  the 
center  pair.  In  low  noise  levels  the  addition  of  com¬ 
municators  contributes  greatly  to  the  number  of  errors. 
Likewise  with  few  communicators  in  the  room  addi¬ 
tional  noise  above  75  dB  contributes  greatly  to  errors. 
The  increase  from  65  to  75  dB  of  noise  does  not  in 
general  increase  the  number  of  errors. 

The  '‘all”  data  on  errors  (not  shown)  are  generally 
equivalent  to  the  "center”  data  except  that  in  general 
fewer  errors  were  made  by  the  noncentral  pairs.  It  must 
be  remembered  that  this  experiment  was  designed  to 
test  speech  level  ns  a  function  of  noise,  error  rates  kept 
minimal.  The  talkt  rs  were  instruc  ted  to  keep  errors  to 
less  than  10%  and  actually  achieved  a  6%  error  rate 
on  the  average.  It  is  not  loo  surprising  therefore  that 
the  analyses  of  errors  show  no  systematic  trends. 

DISCUSSION 

The  increase  in  speech  level  of  5  dB  for  each  10-dB 
increase  of  ambient  thermal  noise  (slope  0.5)  is  some¬ 
what  higher  than  the  change  in  speech  level  with  noise- 
found  by  Krvter,*  by  Hanley  and  Steer,*  by  Korn,16 


*  K.  D.  Kry-cr,  j.  Acoust.  Soc.  Am.  1-S,  413  (19461. 

*  T.  D.  Hanlev  and  M.  D.  Steer,  J.  Speech  Hearing  Disorder* 
14.  363  (1949\  ' 

’•  T.  S.  Korn,  J.  Acoust.  Soc.  Am.  26.  793  (1934). 


and  by  Pickett11  (slope=0.3).  Although  factors  such  as 
the  speech  material,  room  acoustics,  distance  between 
talker  and  listener,  spectrum  of  ambient  noise,  and 
absolute  ambient-noise  level  will  affect  the  speech  level, 
differences  in  siopcs  obtained  are  probably  due  pri¬ 
marily  to  differences  in  feedback  provided  to  the  talker 
and  degree  of  accuracy  required.  In  the  Kryter  and 
Pickett  studies  the  listener  wrote  down  what  the  talker 
said  and  correctness  of  response  was  ascertained  later. 
In  the  Korn  and  in  the  Hanley  and  Steer  studies  there 
was  no  objective  measure  of  communication  effective¬ 
ness  available  to  the  talker.  In  contrast,  in  the  present 
study  the  talker  knew  immediately  after  each  utterance 
how  well  his  speech  was  being  received  and  the  error 
rate  averaged  6%.  This  difference  in  accuracy  and  im¬ 
mediacy  of  feedback  could  account  for  the  greater  slope- 
found  in  the  present  experiment. 

In  this  study  the  number  of  error-  (and  repeat-)  tree 
words  spoken  decreased  with  increasing  ambient  noise. 
This  tendency  decreased  progressively  .is  more  com¬ 
municators  entered  into  the  talking  task.  Hanley  and 
Steer1*  obtained  similar  results  inasmuch  as  increasing 
noise  in  the  headphones  of  talkers  caused  the  speech 
level  to  increase,  the  words  sjioken  per  minute  to  de¬ 
crease,  and  the  mean  syllable  duration  to  increase.  On 
the  other  hand,  Black13  found  that  when  talkers  lucre 
instructed  to  read  at  four  vocal-effort  levels  ranging  from 
soft  to  loud  that  speaking  rate  increased.  Black’s  re¬ 
sults  do  agree,  however,  that  speech  level  and  the 
fundamental  voice  frequency  increased  as  talkers  spoke 
louder  and  louder.  Black’s  talkers  were  not  raising  their 
speech  level  to  combat  noise  and  retain  error-free 
communication  but  only  u-xm  instructions  to  do  so. 

When  Peters14  informed  his  talkers  they  were  not 
being  understood  they  increased  their  sj>cet  h  level  and 
decreased  their  rate  (but  did  not  become  more  intel¬ 
ligible).  It  would  appear  that  speaking  rate  does  not 
always  decrease  as  speech  lcvei  increases.  This  is  true- 
only  when  speech  level  increases  because  of  combatting 
noise  or  unintclsigibility.  When  asked  to  increase  speech 
level  in  a  quiet  environment  and  with  no  communica¬ 
tions  to  maintain,  the  rate  and  level  apparently  increase 
concomitantly.  This  lack  of  direct  relationship  be¬ 
tween  sjK-ech  level  and  rate  may  be  the  reason  why  in 
this  study  rate  decreases  with  increasing  ambient  noise 
but  has  a  tendency  to  increase  when  additional  com¬ 
municators  are  talking  (unless  too  many  are  talking). 


11  J.  M.  Pickett,  j.  Acoust.  Soc.  Am.  30,  278  (!95S>. 

,rT.  D.  Hanlev  and  M.  D.  Steer,  “Effect  of  Level  of  Dis¬ 
tracting  Noise  upon  Speaking  Rate,  Duration,  and  Intensity, " 
Spec.  Devices  Center  Tech.  Rept.  No.  SDC  104-2-14  (June  1949). 

11  J.  \V_  Black.  "A  Relationship  among  Fundamental  Fre¬ 
quency.  Vocal  Sound  Pressure,  and  Rate  of  Speaking.”  Ohio 
State  I'nivcrsitv  Research  Poundage--  and  l*.  S.  Nava!  School 
of  Aviation  Medicine.  Rept.  77  (August  19SS). 

s*  R.  W.  Peters.  "Changes  in  Voice  Intelligibility.  Sound  Pres¬ 
sure  1-cvcJ  of  Resjjonse.  and  Duration  of  Response  as  a  Function 
of  the  Speaker’s  Being  Rcj.vatedly  Informed  that  He  is  not  Being 
Understood  by  His  listeners.”  U.  S.  Nava!  School  of  Aviation 
Medicine  Progr.  Rept.  No.  50  (May  1955;. 


941 


F  A  C  E  -  r  O  -  F  A  C  E  COMM  V  X  I  C  A  T  1  O  X  i  X  X  O  1  S  E 


CONCLUSIONS  their  speech  level  5  dB  if  the  number  of  similarly 

If  a  communicating  pair  has  direct  spoken  feedback  instructed  communicators  around  them  successively 
and  is  required  to  maintain  relatively  error-free  com-  doubles.  'I  heir  rate  of  word  delivery  decreases  with 
munications,  then  for  each  im  rease  of  10  dB  of  thermal  increasing  noise  but  lends  to  increase  with  increasing 
noise  they  raise  their  speech  level  5  uts.  They  also  raise  people  (until  too  many  people  are  added). 
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THE  EFFECT  OF  TALKER-LISTENER  ANGLE 
ON  WORD  INTELLIGIBILITY 

by  P.  O.  Thompson  and  J.  C.  Webster 
U.S.  '  aVV  Elfrimnics  Laboratory.  San  Diego.  California 


Summary 

A  scries  of  experiments  were  conducted  in  a  sound-treated  studio  to  study  the  effects  of 
talker  angle  and  listener  angle  on  speech  intelligibility.  Background  noise  was  introduced 
through  two  loudspeakers  to  control  the  general  intelligibility  level  in  the  vicinity  of  the 
listeners.  The  noise  also  served  to  neutralize  the  effects  of  speech  reflections  in  the  room. 
Seventy  word-tests  were  run  in  three  sub-experiments.  T^e  talker  read  a  list  of  fifty  C-Y-C 
words  for  each  test.  The  results  indicated  that  speech  intelligibility  varies  more  with  listener 
angle  than  with  talker  angle,  at  least  within  45  s  on  either  side  of  the  talker.  However,  the 
relationship  between  intelligibility  and  listening  angle  may  have  been  influenced  by  the 
directionality  of  the  noise  source  relative  to  the  talker  and  the  listeners. 

Zusammeufassung 

Einc  Reihc  von  Untcrsuehungcn  warden  in  eincm  schallgedampften  Tonstudio  durdi- 
gefuhrt,  um  die  Eintiiisse  von  Sprechcrwinke!  und  Horcrwinkel  auf  die  Sprachverstandlich- 
keit  zu  uhtersuchen.  Um  den  durdischnittlichen  Verstandlichkcitspegcl  in  dcr  IS'ahc  der 
Horer  zu  rcgulieren.  wurde  Hintcrgnindgcrausch  mit’.cls  zweicr  Lautsprccher  in  das  Ton- 
studio  cingefuhrt.  Dieses  Gcrausch  '.eutralisierte  auch  den  EinfluB  von  raumlichin  Sprach- 
riickstrahlungc-n.  Siebzig  Wortproben  wurden  in  drei  Nebcnuntcrsuchur.gen  durdigefuhrt. 
in  welchen  der  Sprechcr  jewcils  cine  Reihc  von  fiintzig  cinsilbigen  Wortcrn  fur  jedc 
Prcbc  las. 

Die  Ergebnisse  zeigten,  daB  die  Sprachverstandlichkcit  einzelncr  Worter  mehr  von  deni 
Horcrwinkel  als  von  dem  Sprecherwinke!  abhangig  war.  zumindcst  innerhalb  von  45° 
heiderseitig  dcs  Sprecjers.  Dennoch  ist  es  moglich.  daB  des  VcrhSltnis  von  Verstandiichkeit 
zu  Horcrwinkel  durch  die  Richtung  der  Rauschquelle  hezogen  auf  Sprechcr  und  Hoier  hitte 
beeiafluBt  werden  konnen. 

Sommai  re 

On  a  procede  a  une  serie  d’cxpcriences  dans  un  studio  acoustiquc  pour  eludier  les  effets 
de  I’anglc  d'ecoute  et  de  Tangle  de  la  j  ale  sur  Tintclligibilitc.  Un  bruit  de  fond  etait 
enveye  sur  deux  hact-parleurs  de  fajon  a  pauvoir  controlcr  le  niveau  general  d’intelligibilite 
dans  1c  vvisinage  des  ecouteurs;  le  bruit  servait  cgalement  a  neutraliser  les  effets  de  re¬ 
flexion  dsns  la  sallc.  Dans  trois  experiences  primaircs.  on  se  servit  de  soixante  dix  mots 
«test*.  Pour  diaque  essai,  le  «parlcun»  lisait  cinquantc  mots  C.V.C.  Les  rcsultats  ont  montre 
que  Tintclligibilitc  variait  plus  avec  Tangle  de  direction  de  «Tecoutcun>  que  celui  du  «par- 
lcur»,  pour  des  variations  d'angle  du  « pari  carp  allant  jusqu’a  45“  ac  chaque  cole.  Quoiqu’il 
eti  soil,  la  relation  entre  Tintclligibilite  et  Tangle  d’ecoute  peut  avoir  etc  influcncce  par  la 
dircctivite  de  la  source  sonorc  comparativcmcnt  au  «parlcur»  et  a  oTeoouteur*. 


1.  Introduction 

Recently  Chalcpova  and  Slavik  [1J,  working 
with  listeners  who  vve-rc  stationed  in  a  rank  ana  tile 
formation,  found  that  those  who  were  directly  in 
front  of  the  talker  often  scored  lower  than  those 
who  were  slightly  to  the  side,  even  out  to  an  angle 
of  45  .  Although  the  cause  of  this  effect  could  have 
been  the  listening  angle  (the  relative  direction  the 
listener  was  facing),  the  experimenters  suggested  it 
might  have  been  due  to  the  directional  character¬ 
istics  of  the  speech  or.  in  other  words,  the  talker 
angie.  Since  they  did  not  independently  control 


eithei  variable,  they  could  not  specify  the  respon¬ 
sible  agent. 

Because  of  the  curiosity  stirred  up  by  this  ques¬ 
tion  of  a  jmssiblc  gain  in  intelligibility  15°  to  45" 
off  the  speech  axis,  an  effort  was  made  to  coordinate 
work  on  this  question  with  a  research  program 
aimed  primarily  at  the  effects  of  noise  characteris¬ 
tics  and  talker  distance  on  word  intelligibility  and 
vocal  effort.  The  background  noises  required  bv  the 
program  would  mask  the-  indirect,  reflected  com¬ 
ponents  of  the  speech  signal  and  would  thereby 
make  possible  a  study  of  the  directional  character¬ 
istics  of  the  direct  component. 
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2.  Proccdare 

The  major  variables  in  the  experimental  program, 
into  which  the  listener  and  .talker  angle  sub-variables 
were  woven,  were  word  intelligibility  vs.  distance 
between  talker  and  listener,  and  type  of  ambient 
noise.  By  counterbalancing  the  location  of  the 
listeners  around  the  talker,  the  direction  the  listen¬ 
ers  faced,  and  the  direction  the  talker  faced,  it  was 
possible  to  get  the  major  data  of  distance  and  noise 
type  and  still  extract  data  on  listener  and  talker 
angle. 

Three  experiments  were  conducted:  The  first 
dealing  primarily  with  the  effects  of  distance  be¬ 
tween  talker  and  listener  (1.8,  3.6,  and  0.9  m),  the 
second  primarily  on  talker  and  listener  ingle,  and 
the  third  on  the  type  of  background  noise  used.  All 
three  experiments  were  conducted  in  a  sound-treated 
studio  that  was  especially  designed  for  recording 
speech.  Cylindrical  surfaces  were  purposely  design¬ 
ed  into  the  room  to  achieve  uniform  diffusion  of 
reflected  sound.  A  plan  view  of  the  teat  room,  in¬ 
cluding  the  layout  c'  noise  sources  (loudspeakers), 
and  the  general  positions  of  talker  and  listeners  is 
presented  in  Fig.  1.  The  loudspeakers  were  position¬ 
ed  such  that  the  masking  noise  gave  the  most  uni¬ 
form  coverage  over  all  listening  positions. 


Fig.  1.  Plan  view  of  the  arrangement  of  the  talker 
locations  and  listener  locations  (the  sets  of  four 
0.9's,  1.8's  and  3.6's  near  the  top  in  each  case 
in  combination  with  the  “L”,  which  was  the 
listener  location  common  to  all  three  arrange¬ 
ments).  The  angular  limits  of  the  three  listen¬ 
ing  formations  (dashed)  and  the  direction  the 
talker  generally  faced  (solid)  are  indicated  at 
the  talker  locations.  In  the  third  experiment  the 
whole  1-8  m  talker-listener  complex  was  rotated 
15°  either  to  the  left  or  to  the  right  for  some 
of  the  tests. 


Rhyme  word  lists  were  used  for  all  tests  [2].  Each 
list  is  composed  of  50  C-V-C  words.  Rhyme  words 
retain  the  same  vowels  and  final  consonants  from 
list  to  list  but  the  initial  consonants  change  from 
list  to  list. 

2.1.  Experiment  / 

In  this  experiment  the  distance  between  the  talker 
and  the  listeners  was  systematically  varied  from  1.8 
to  3.6,  to  0.9  m.  At  all  three  distances.  (1)  the 
listeners  faced  the  talker  half  the  time  and  ISO1' 
away  from  him  half  the  time,  (2)  each  listener  ad¬ 
vanced  counterclockwise  to  the  next  chair  after  each 
test  word  list,  and  (3)  jet  aircraft  noise  provided 
the  masking  background  for  listening. 

At  the  0.9  m  distance  the  talker  faced  the  center 
listener.  The  remaining  four  listeners  were  spaced 
on  an  arc  symmetrically  around  the  center  listener 
at  ±35°  and  ±70°  (see  Fig.  1).  The  jet  noise 
sound-pressure-level  (SPL)  varied  from  64  through 
76,  88,  100.  to  106  dB(C)  for  each  pair  of  word 
lists  (half  with  listeners  facing  the  talker  and  half 
with  listeners  facing  away  from  the  talker). 

At  the  1.8  m  distance  the  talker  faced  the  most 
clockwise  listener  position.  The  remaining  four 
listeners  were  spaced  15°  apart  along  the  arc  of 
1.8  m  radius  (see  Fig.  1).  The  jet  noise  levels  were 

r j  "4  OO  n<  ...  f  *aa 
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At  the  3.6  m  distance  the  talker  again  faced  the 
central  listening  position.  But  the  other  listeners  were 
not  located  symmetrically  around  the  central  posi¬ 
tion  but  at  ±  15°,  —30°.  and  —45°  (see  Fig.  1), 
The  jet  noise  levels  were  64,  70,  76,  82.  and 
88  dB(C). 

Experiment  I  included  two  listener  angles  (fac¬ 
ing)  :  0°  and  180°.  and  many  talker  angles  0°  and 
angles  in  steps  of  15°  to  60°.  and  in  steps  of  35° 
to  70°.  The  off-centering  of  the  talker  ( 1.8  m  dis¬ 
tance)  and  the  listeners  (3.6  in  distance)  was  an 
attempt  to  balance  out  the  geometry  of  the  room- 
loudspeakei-chair  locations  and  arrive  at  true  talker- 
angle  functions.  The  different  noise  levels  at  dif¬ 
ferent  talker-listener  distances  was  an  attempt  to  get 
roughly  equivalent  test  scores  for  the  various  dis¬ 
tances. 

2.2.  Experiment  11 

Experiment  1:  was  divided  into  two  parts  of  five 
tests  each.  For  both  parts  the  noise  background  was 
88  dB  jet  noise  and  the  talker-listener  distance  was 
1.8  m.  In  the  first  part  the  listening  angle  was  varied 
from  one  test  to  the  next,  from  through  30r.  60" . 
and  90s.  to  180J.  The  talker  faced  the  listener 
farthest  clockwise  (0W  at  the  1.8  m  position  in  Fig.  1) 
for  all  five  tests.  Each  listener  moved  one  place 
counterclockwise  after  every  test. 
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in  the  second  part  the  listeners  did  not  change 
positions  at  all  and  always  fa«ed  the  talker.  The 
talker  fared  the  listener  farthest  <  lock  wise  for  the 
first  test  and  on  each  succeeding  test  he  retailed  13 
to  the  left  t <o  lace  a  different  listener.  In  this  way 
talker  angle  was  less  dependent  u|»on  chair  location. 

2.3.  Experiment  111 

In  experiment  Ill  a  set  of  five  tests  was  run  for 
each  of  six  different  background  noises  at  a  talker- 
listener  distance  of  1.8  m.  However,  for  the  pur¬ 
poses  of  this  report  the  difference  in  noise  back¬ 
ground  is  irrelevant  and  will  not  be  discussed  fur¬ 
ther. 

In  addition  to  the  fixe  listener  positions  shown  in 
the  1.8  in  arc  of  Fig.  1.  an  additional  one  was  added 
on  each  side.  15'  beyond  the  outside  ones  indicated 
in  the  figure.  For  two  sets  of  five  tests  each,  the 
listeners  sat  in  the  five  chairs  farthest  clockwise,  for 
another  txvo  sets  they  sat  in  the  five  regular,  interior 
positions,  and  for  the  other  two  sets  they  sat  in  the 
five  positions  farthest  counterclockwise.  The  use  of 
these  seven  listener  positions  made  talker  angle  less 
dependent  upon  specific  single  positions.  In  each 
pair  of  sets  the  talker  faced  the  listener  farthest  to 
his  left  for  or.c  set  and  the  listener  farthest  to  his 
right  for  the  other  set.  In  every  set  of  five  tests  the 
listeners  faced  ihc  iulkcr  on  the  first  test  olid  then 
30'.  60  .  90  .  and  180".  respectively,  from  him 
on  succeeding  tests.  After  every  test  the  listeners 
also  exchanged  seats  in  a  counterclockwise  direction. 

3.  Results 

3.1.  Experiment  1 

The  results,  corrected  to  give  the  data  for  each 
talker-listener  distance  the  same  mean,  are  plotted 


Fig.  2.  W  ord  intelligibility  result*  of  experiment  I  as  a 
function  of  talker  angle  tshe  angle  between  the 
direction  the  talker  faced  and  the  location  of 
the  li*4cncr).  The  parameter  is  the  distance 
hriwcro  the  talker  and  the  listeners.  The  talker 
was  immobile,  x*  that  talker  angle  was  not  in¬ 
dependent  of  the  location  of  the  listener's 
chairs.  The  listeners  moxed  one  position  coonter- 
ciock-wbe  after  every  list,  facing  toward  the 
talker  .  listening  angle  0  t  on  half  the  lists  and 
ISO  awax  on  the  »Uwr  half 


in  Fig.  2  with  talker  angle  for  various  distances  as 
the  parameter.  The  0.9  m  data  show  a  drop  in 
intelligibility  as  the  talker  angle  increased  from 
zero,  while  the  longer  distance  data  show  an  in- 
<  rease.  at  least  out  to  30  .  The  loss  from  listening 
with  hacks  to  the  talker  was  not  as  great  as  might 
he  expected.  in  one  2-test  comparison  it  was  13.7 
l»er  cent  hut  otherwise  it  ranged  from  0  to  7  per 
cent. 

3.2.  Exf>eriment  II 

Fig.  3  shows  the  results  of  experiment  II.  Th~ 
listeners  J5C  and  30  from  the  direction  the  talker 
faced  scored  higher,  on  the  average-  than  the  listen¬ 
ers  directly  ahead  a*  0  .  Further,  the  listeners  scor¬ 
ed  higher  on  the  average,  when  they  did  not  face 
the  talker  directly  but  instead  faced  away  from  him 
by  30  and  60  .  90 ~  listening  was  about  equal  to 
0  listening,  and  ISO  listening  was  substantially 
worse. 


i  -i-  ■/  » 

Fig.  3.  Results  of  experiment  if.  The  listening  angle 
results  (A)  are  averaged  over  ali  talker  angels 
and  show  the  effect  of  rotating  at  a  fixed  listen¬ 
ing  location  through  30'.  6C  .  90s.  and  180'. 
The  talker  angle  result*  arc  of  two  types:  (!)  a 
particular  talker  angle  represented  by  a  parti¬ 
cular  listener  chair  or  Jocaiscs  with  talker  im¬ 
mobile  and  (2)  talker  angle  varied  by  rotation 
of  the  talker  between  lists,  facing  a  dificrest 
listener  kea lion  on  ever,  list  and  making  a 
specific  taiker  angle  independent  of  a  specific 
listener  location  in  the  room.  In  this  second  tvpe 
each  listener  remained  in  a  particular  chair 
throughout. 

■L?.  Experiment  III 

Fig.  4  shows  a  gain  in  intelligibility  at  oblierse 
listening  angles  similar  to  bnt  less  than  the  gains 
shown  in  the  experiment  I!  data  <Fig.3i.  and.  in 
contrast,  in  this  experiment  the  mean  score  v»r  lis¬ 
teners  directly  in  front  of  the  talker  is  as  high  as 
tor  any  other  angle,  i The  O'  and  36  mean  scores, 
arr  equivalent  tor  these  six  sets  of  Hsts.  i 
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Talker  or  listening  angle  — *•» 


Fig.  4.  Results  of  experiment  III.  The  talker  angle  data 
is  for  the  following  test  conditions:  the  talker 
faced  alternately  one  end  of  the  listening  forma¬ 
tion  or  the  other  for  each  group  of  five  lists, 
hut  was  immobile  within  such  a  group  of  lists, 
while  the  listeners  (between  lists)  were  exchang¬ 
ing  chairs.  The  whole  listening  arrangement  was 
shifted  left  15°  for  1/3  of  the  lists  and  right 
15°  for  another  1/3  of  the  lists  by  using  one 
additional  chair  on  each  end  of  the  regular  five- 
chair  formation.  The  listening  angle  data  was 
obtained  by  rotating  the  listeners  through  30°, 
60",  90°,  and  180°  from  list  to  list. 

4.  Discussion 

In  experiments  II  and  III  (Figs.  3  and  4)  where 
listening  angles  were  varied  extensively,  listening 
angles  of  30°,  60°,  and  90°  resulted  in  better  aver¬ 
age  scores  than  0°.  The  results  on  the  effect  of 
talker  angle  in  the  three  experiments  showed  a 
number  of  cases  of  highest  intelligibility  directly 
ahead  of  the  talker  as  well  as  a  number  of  cases 
in  which  it  was  highest  at  some  angle  within  the 
range  of  15°  to  45°  to  the  side.  In  the  first  two  ex¬ 
periments,  out  of  six  sets  of  five  tests  each  (exclud¬ 
ing  the  3.6  m  data  with  its  poorer  uniformity  of 
noise  in  the  listening  area) ,  four  turned  out  in  favor 
of  some  oblique  talker  angle.  In  experiment  III, 
however,  the  result  was  that  in  three  of  six  sets  of 
tests,  0°  talker  angle  resulted  in  a  higher  mean. 

An  “Analysis  of  Variance”  treatment  of  the  first 
series  of  tests  in  experiment  II  {the  only  set  sub¬ 
jected  to  such  analysis)  classified  the  differences 
found  on  the  listening-angle  variable  as  “very  signi¬ 
ficant.”  However,  the  effect  of  talker  angle,  which 
in  this  set  favored  15°  and  30°  quite  substantially, 
was  found  to  be  “non-significant”  by  this  analysis. 

The  listening-angle  effect  in  experiment  III  was 
somewhat  less  but  essentially  the  same. 

4.J.  Talker-angle  effects 

What  generalizations  can  be  made  concerning  the 
effect  of  talker  angle?  Starting  with  the  experiment  I 
data  for  the  0.9  m  talker-listener  distance  (Fig.  2) 


(which  because  of  highest  ratio  of  direct  speech 
component  to  indirect  components  may  be  the  most 
valid)  the  0C  talker  angle  yielded  the  highest  in¬ 
telligibility.  This  holds  true  regardless  of  whether 
the  listener  was  facing  frontwards  or  backwards.  In 
the  other  experiment  I  data  shown  in  Fig.  1,  oblique 
angles  yielded  highest  :ntelligibil«ty.  In  experiment  II 
(Fig.  3)  regardless  of  the  experimental  procedure, 
small  oblique  angles  yielded  the  highest  intelligi¬ 
bility.  In  experiment  III  (Fig.  4)  0°  talker  angle 
yielded  the  highest  intelligibility  again.  Experiment 
III  should  be  heavily  weighted  in  consideration  be¬ 
cause  it  represents  six  sets  of  five  tests  each  and  the 
effects  of  angle  of  talker-facing  and  orientation  in 
the  room  were  fairly  well  neutralized  by  counter¬ 
balancing.  In  this  experiment  0°  talker  angle  aver¬ 
aged  highest  in  intelligibility  with  the  30°  intelli¬ 
gibility  essentially  the  same. 

In  almost  every  case  considered  in  the  three  ex¬ 
periments  the  result  was  different,  sometimes  favor¬ 
ing  one  angle  and  sometimes  another.  Considering 
the  total  0.9  and  1.8  m  data  by  sets  (groups  of  five 
tests),  in  four  of  the  twelve  sets  intelligibility  was 
highest  at  0°;  in  another  three  sets  it  was  highest 
at  15°:  in  three  sets  it  was  highest  at  45°;  and  in 
two  sets  it  was  highest  at  30°.  On  the  basis  of  these 
results  it  appears  that  intelligibility  is  best  in  front 
of  the  talker  but  that  intelligibility  is  essentially 
equivalent  over  a  broad  arc  in  front  of  him. 

As  another  check  on  the  effect  of  talker  angle  the 
results  of  all  ten  sets  of  1.3  m  data  were  averaged 
and  the  means  for  talker  angles  of  from  0°  to  60° 
were  75,  76,  77,  74,  and  72  per  cent,  respectively. 
On  the  average,  there  was  a  trend  of  higher  intelli¬ 
gibility  at  slight  angles  off  the  speech  axis,  amount¬ 
ing  to  2  per  cent  at  30°.  This  is  tempered,  however, 
by  the  fact  that  for  the  average  of  six  of  the  ten 
component  sets  (all  of  experiment  HI),  the  0C-  and 
30'-mean  •.-cores  were  equivalent  and  in  three  of  the 
six  sets  of  tests  0°  talker  angle  yielded  the  highest 
score.  These  experirnen*  III  cores  are  corrected  for 
over-all  intelligibility  difference  from  chair  to  chair. 
Before  correction  me  intelligibility  advantage  of  0° 
v;as  prominent.  Corrections  reduced  this  by  2.5  per 
cent. 

4.2.  Noise  direction  problem 

The  noise  SPL  varied  no  more  than  i  1  dB  with¬ 
in  each  seating  arrangement,  except  in  the  case  of 
the  two  most  counterclockwise  listening  locations  in 
the  3.6  m  data.  The  superior  intelligibility  at  talker 
angles  of  30°  and  45°  in  the  3.6  r,r  data  can  he  at¬ 
tributed  ;o  this  factor.  The  next  question  is  whether 
noise  direction  characteristics  were  equivalent  from 
one  listener  location  io  another.  Subjectively  to  the 
listener  the  noise  source  was  the  loudspeaker  closest 
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to  him.  except  that  on  the  mill-line  between  the  loud¬ 
speakers  the  apparent  source  was  a  point  midway 
between  them.  A  survey  of  the  spatial  arrangements 
(Fig.  1)  reveals  that,  aside  from  the  obvious  in¬ 
equality  of  both  noise  level  and  direction  character¬ 
istics  in  the  two  3.6  m  locations  mentioned  above, 
the  other  listener  locations  were  essentially  equiva¬ 
lent  in  these  respects.  As  regards  noise  direction 
characteristics,  for  example,  the  apparent  source  of 
noise  for  all  five  listeners  was  quite  close  to  180° 
from  the  direction  of  the  talker.  This  similarity  sup¬ 
ports  confidence  in  the  validity  of  the  talker  angle 
results,  even  though  these  talker  angles  were  not 
wholly  independent  of  specific  locations  in  the  room. 

A  final  question  needing  attention  is  the  possible 
effects  of  the  noise  direction  charactei  istics  on  the 
listening  angle  variable.  According  to  the  findings 
of  Koch  [3].  when  speech  and  noise  are  ?ep<.  ‘“d 
by  exactly  180°  (re.  the  listener’s  location),  listen¬ 
ing  angles  of  0°  and  180°  worsen  by  up  to  12  dB 
the  minimum  detectable  (binaural)  threshold  for 
speech,  as  compared  to  listening  angles  such  as  45°, 
90°.  and  135^. 

It  seems  logical  that  the  directivity  effect  upon 
intelligibility  would  be  very  similar  to  the  effect 
upon  threshold  and  it  is  quite  possible  that  noise 
direction  characteristics  related  to  listening  angles 
of  0°  and  180°  in  this  study  handicapped  intelli¬ 
gibility  in  these  cases.  However,  there  is  a  possi¬ 
bility  that  Koch’s  findings  should  not  be  applied  to 
this  study  because  he  used  an  anechoic  room,  while 
the  present  study  was  conducted  in  a  fairly  reflective 
room.  (The  room  volume  of  this  voice-recording 
studio  is  about  100  m3  and  the  reverberation  time 
is  about  0.5  s  above,  and  about  0.3  s  below  1  kc/s.) 

Using  very  similar  methods  to  those  of  Koch, 
Hirsh  [4]  found  slightly  iess  effect  (9  dB)  for  the 
same  conditions,  and,  when  a  “highly  reverberant” 
room  was  used,  the  effect  was  almost  completely  de¬ 
stroyed.  In  the  reflective  room  the  average  differ¬ 
ence  in  thresholds  he  found  between  “Front-back” 
and  “Back-front”  versts  “Right-left”  (O'  and  180° 
versus  90°)  was  only  2  dB. 

The  room  Hirsh  used  was  probably  much  more 
reverberant  than  fhc  one  used  in  this  study  and 
the  effect  in  the  present  data  would  be  more  than  in 
die  very  reverberant  room  but  less  than  in  the 
anechoic  room.  An  agent  that  might  have  reduced 
ihis  artifact  in  the  present  study  is  the  fact  that  the 
noise  was  not  exactly  180°  for  all  listening  posi¬ 
tions  relative  to  the  talker,  thereby  giving  the  lis¬ 
tener  some  extra  benefit  from  binaural  localization 
cuts  (lateral  separation  of  speech  from  noise).  Both 
Hirsh  and  Koch  rigidly  controlled  source  angles, 
but  only  at  90°  intervals  in  the  case  of  Hirsh  and 
only  at  45°  intervals  in  the  case  of  Koch.  It  is  pos- 
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sible  that  5°  or  10°  offset  of  the  apparent  noise 
source  could  have  drastically  reduced  the  effect, 
adding  to  the  neutralization  caused  by  the  reverber¬ 
ation  characteristics  of  the  room. 

The  2  dB  effect  Hirsh  found  remaining  in  the 
reverberant  case  is  enough  to  account  for  the  rough¬ 
ly  5  per  cent  (words  correct)  improvement  at  listen¬ 
ing  angles  of  30c  and  60u  found  in  this  experiment. 
Therefore,  whether  the  effect  found  here  is  due  to 
pure  listening  angle  (dependent  only  upon  the  angu¬ 
lar  relationship  of  talker  and  listener)  or  is  an 
artifact  of  the  directional  characteristic  of  the  noise 
cannot  be  answered. 

5.  Summary  and  conclusions 

A  talker  read  70  lists  of  50  monosyllabic  words 
each  to  five  listeners  in  a  room  designed  for  the 
recording  of  speech.  The  listening  was  done  in  a 
noise  background.  The  variables  studied  were  talker 
angle,  listening  3ngle,  and  their  effect  upon  speech 
intelligibility.  The  angular  relationship  among  tal¬ 
ker,  listeners,  noise  sources,  and  room  were  varied 
extensively  to  isolate  the  effects  of  talker  angle  and 
listening  angle  from  most  of  the  other  factors. 

The  results  support  a  view  that  talker  angles  (at 
least)  within  the  range  between  0“  and  d:45°  do 
not  have  any  differential  effect  on  intelligibility. 
The  result  of  varying  listening  angle  away  from  0' 
to  30°  and  60°  was  higher  intelligibility.  This  re¬ 
sult,  however,  is  open  to  question  in  the  light  of  the 
findings  of  Hirsh  and  Keen,  which  suggest  that  with 
the  configuration  of  speech  source  and  noise  source 
used  in  the  present  experiments,  binaural  discrimi¬ 
nation  factors  substantially  handicap  intelligibility 
at  listening  angles  of  0°  (and  180°),  at  least  as 
compared  to  angles  from  45°  to  135°, 

Only  further  experiments  under  free-field  con¬ 
ditions  can  give  adequate  appraisal  of  the  effect  of 
(binaural)  listening  angle  on  intelligibility.  If  en¬ 
vironmental  noise  is  introduced  to  control  the  gener¬ 
al  level  of  intelligibility,  it  must  be  controlled  in 
such  ways  as  tc  neutralize  unwanted  binaural  dis¬ 
crimination  cues.  Or  the  experiments  could  be  con¬ 
ducted  in  open  air  so  that  the  direction  of  noise 
incidence  would  have  a  quasi-random  characteristic. 

(Rccdnd  16th.  1962.) 
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Summary 

Two  experiments  were-  conducted  to  study  the  effects  on  speech  intelligibility  of  talker 
angle  and  listener  angle  and  the  distance  between  talker  and  listener  under  free-ficld  con¬ 
ditions.  Some  findings  were  that  (1)  speech  intelligibility  falls  off  with  distance  in  a  manner 
consistent  with  a  3dB  per  distance  doubled  fall-off  in  speech  sound  pressure  level;  (2)  the 
intelligibility  in  a  broad  arc  from  —45°  to  +45°  in  front  of  the  talker  was  essentially 
equal;  (3)  the  effect  of  turning  the  listener  15°  to  75°  away  from  the  talk.-r  was  a  mean 
gain  of  about  4%  or  a  gain  equivalent  to  about  3.5  m  in  distance;  (4)  the  observed  direc¬ 
tional  aspect  of  intelligibility  agreed  quite  well  with  SPL  measurements  made  by  olbcr 
investigators  around  the  heads  of  a  model  and  a  human;  and  (5)  the  effects  of 
distance  and  wind  in  this  study  were  in  general  agreement  with  the  results  of  previous 
studies. 

Zusammenfassung 

Zwei  Ycrsuche  wurden  durchgefuhrt.  um  den  EinfiuB  lies  Sprecherwinkels.  des  Horer- 
winkels  und  des  Abstandcs  zwischen  Sprecher  und  Horer  auf  die  Sprachverstandlidikdt  im 
freien  Schallfeld  zu  untersuchen.  Unter  anderem  wurden  folgende  Beobaditungen  gemndil: 
(1)  Die  Spradivcrstandlichkeit  fallt  mil  dcr  Entfcrnung  in  einer  Wcisc,  die  mil  einer  Yer- 
ringerung  des  Sdialldrnckpegels  um  jcwcils  3  dB  bei  Verdoppclung  des  Abstandcs  ertlart 
werden  kann;  (2)  die  Yerstandlichkcit  war  uber  einen  breiler-  Bercich  von  etwa  —45°  bis 
—45“  vor  dem  Sprecher  im  wesentlicfcca  konstant:  (3)  Drchung  des  Horcrs  um  15''  bis 
75°  vom  Sprecher  fort  bra  chic  cine  Verbesserung  dcr  Yerstandlichkcit  tun  etwa  4%,  was 
einer  Verringerung  dcr  Entfcrnung  um  etwa  3.5  m  cntspridu;  (4)  die  beobadit-Jc  Rkh- 
iungsabhangigkcit  dcr  Yerstandlichkcit  stimmt  gut  fi herein  mit  Schalldruckpcgcinjcssungen 
anderer  AuJorcn  an  Modellen  und  Pcisonen;  (5)  der  EinfiuB  von  Entfcrnung  und  Wind 
war  bei  dieser  lintersuchung  wcilgchcnd  dcr  gleiche  wie  in  frfiherrn  Untersudiungen. 

Sommaire 

On  a  fah  deux  experiences  pour  ctuaicr  les  eficts  sur  Tintelligibilitc  de  la  parole,  de 
Tangle  d'ecoate.  de  Tangle  dc  parole,  el  de  la  distance  entre  aedileur  el  ^speaker*  dans 
Its  conditions  de  champ  libre.  Quelqucc  rcsultats  ont  montre  qcc: 

1)  TinUlligibilite  diminue  avee  la  distance,  oe  qui  sc  traduit  qualitativcrrcnt  par  za 
aifaibiissetnent  de  3  dB  du  niveau  so  tore  moves  par  distance  donblee. 

2)  Tintelligibilitc  reslc  la  mime  pour  ui  angle  iToavertare  allant  de  — 4>°  *  -45°  de  pan 
ct  d'autre  du  «speaker*. 

3)  lotsque  Ton  Icurcc  Taudilccr  de  !5C  a  75s  du  «  speaker*.  1c  gain  rooyen  est  d'enrinrn 
4S.  ce  qui  correspond  a  un  gain  en  distance  de  3.5  m. 

4)  TeSTet  directicnne]  de  Finielligibilite  dc  !a  parole  qcc  nous  avoirs  observe  est  parfaitc- 
roent  ea  accord  avec  les  mtsurcs  SPL  faiies  par  d'auues  chertheurs  operant  avee  la  tele 
dun  mannequin  ct  d'ua  h&rame. 

5)  les  efiets  de  distance  ct  de  vent  dans  celte  etude  seal  en  general  cn  accord  avee  Its 
rcsultats  des  etudes  prcsedcntes. 
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1.  Introduction 

Chall'POva  and  Slavik  [1]  recently  studied  in¬ 
telligibility  as  a  function  of  the  angular  relationship 
between  the  talker  and '  listeners  in  a  rectangular 
formation  of  r>ws  and  columns  in  an  open  held. 
Their  most  intere>‘.hig  finding  was  that  the  intelli¬ 
gibility  on  the  speech  axis  was  lower  than  it  was  10° 
to  45°  off  the  axis.  Their  experimental  design  did 
not  allow  them  to  determine  whether  this  was  caused 
by  directional  characteristics  of  the  voice  or  the 
directional  characteristics  of  the  external  ear.  Other 
studies  [2].  [3]  have  not  shown  a  lowering  of 
intelligibility  on  the  speech  axis,  hut  perhaps  this 
was  because  they  were  not  as  intensive  in  angular 
coverage,  number  of  listeners,  and- number  of  re¬ 
plications. 

In  order  to  chetk  the  Chalgpuva  and  Slavik  re¬ 
sults  two  experiments  were  conducted  on  an  open 
field  at  the  U.  5.  Naval  Training  Center  in  San 
Diego.  In  both  experiments  Navy  recruits  were  the 
listeners,  and  the  parade  grounds,  Preble  Field,  was 
the  open  field.  Preble  Field  is  305  m  long  by  137  m 
wide  and  is  surfaced  Vi  ith  asphalt  paving. 

To  study  the  dependence  upon  the  listener’ s  loca¬ 
tion  relative  to  the  talker’s  mouth,  in  experiment  I 
the  talker  rotated  through  a  series  of  angles  relative 
to  the  listeners.  To  study  the  dependence  upon  the 
direction  the  listener  was  facing  relative  to  the  tal¬ 
ker’ s  location,  the  listeners  in  both  experiments  were 
divided  so  that  at  any  time  half  of  them  faced  the 
talker  and  the  others  faced  away  from  him  by  small 
angles. 

2.  Procedure 

2.1  General 

The  intelligibility  tests  used  were  the  Faizbaxks 
Rhyme  Tests  [4J.  These  are  lists  of  50  monosyl¬ 
labic  words  (consonant-vowel -consonant)  such  as 

“1.  cop,  2.  wire. - .  50.  ten”.  There  are  five 

basis  tests  using  words  that  not  only  rhyme  from  test 
to  test,  but  ate  spelled  the  same  except  for  tbe  initial 
consonant.  For  instance,  a  second  test  might  be 

“1.  pop.  2.  sire. - 50.  den”,  or  “1.  top,  2.  wire, 

- .  50.  pen”,  etc.  The  answer  sheets  show  the 

vowel  and  final  consonant  of  each  word  (thus. 
“1.  —  op,  2. — ire. - ,50-  —  en”) . 

listeners  were  instructed  to  fill  in  the  initial  con¬ 
sonant  of  each  word  as  it  was  read,  even  if  they  had 
to  guess.  In  eadr  experiment  a  different  rhyme  word 
order  was  used  for  every  test  conducted-  Fourteen 
50- word  lists  were  used  in  cadi  experiment.  The 
listeners  were  thoroughly  instructed  in  the  testing 
procedure  and  were  given  a  practice  tesi. 


The  talker  for  these  experiments  was  the  first 
author,  who  has  a  resonant,  moderately  pitched 
voice,  is  somewhat  above  average  in  articulation, 
and  is  very  experienced  as  a  talker  for  intelligibility 
tests.  The  talker  monitored  his  speech  level  on  a 
small  sound-level  meier  mounted  0.5  m  in  front  of 
him  at  chest  height.  He  spoke  with  raised  voice, 
maintaining  an  average  85  dB  sound  pressure  level 
(SPL)  on  voice  peaks.  This  corresponds  to  an  rms 
SPL  of  75  dB  at  1  m.  The  word  “write”  was  read 
just  ahead  of  each  test  word,  both  to  act  as  a  “car¬ 
rier”  for  the  test  word  and  to  be  used  as  a  calibra¬ 
tion  word  for  level  monitoring.  One  test  word  was 
read  every  2  s  to  the  beat  of  an  electronic  interval 
timer.  When  extraneous  noises  were  of  such  a  nature 
as  to  interfere  with  speech  reception,  the  talker 
stopped  until  he  considered  the  noise  level  within 
acceptable  limits  again.  The  noises  that  caused  oc¬ 
casional  temporary  stoppages  were  from  aircraft 
flying  in  the  vicinity-  A  microphone  worn  around 


Fig.  1.  Talker-listener  locations  and  facings  ter  ex¬ 
periments  I  and  II.  In  experiment  1. 42  listeners 
sal  at  intersections  of  arts  A  through  F  and 
7  radial?  spaced  at  15s  intervals.  Ia  experiment 
II  listeners  also  sat  og  arc  C.  Ia  experiment  I. 
the  talker  (T)  faced  the  center  (0°)  radial  on 
tests  1.  ?,  13,  and  14,  the  —15s  radial  oss  tests 
3  and  4.  the  —30s  radial  ca  tests  5  and  6.  the 
—45°  radial  on  tests  7  aad  8.  —90'  oo  tests 
9  and  10.  aad  180°  oo  tests  11  and  12.  Ia  ex¬ 
periment  II  the  talker  always  faced  the  0s 
radiaL  In  both  experiments  listeners  in  arcs  B. 
D.  and  F"  faced  the  4-43®  radial  on  alt  odd- 
numbered  tests,  while  listeners  in  arcs  A.  C. 
aad  E  (and  C  ia  II)  faced  the  talker  location. 
The  listeners  reversed  their  direction  of  facing 
on  even-numbered  test*.  Le.  those  who  had 
faced  tbe  — 45c  radial  faced  the  talker,  and 
those  who  had  faced  the  talker  faced  the  -f  45® 
radiaL  Note  that  oo  the  —45®  radial,  faring 
the  -alkcr  was  equivalent  to  faring  the  -r-K° 
radiaL  Ia  hcah  experiment?  the  wind  direction 
varied  between  west  and  west-northwest:  in  ex¬ 
periment  I  wind  velocity  varied  between  27  to 
35  km/h  and  in  experiment  11  wind  velocity 
varied  between  0  aad  9  hrajh. 
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the  neck  by  the  talker  was  used  to  record  the  test 
sessions  for  later  review  of  speech  and  noise  levels. 

As  illustrated  in  Fig.  1.  the  listeners  sat  in  con* 
centric  arcs  2.3  m  apart,  starting  4.3  m  from  the 
talker.  Listeners  sat  behind  one  another  on  7  radials 
spaced  every  15°  from  —45°  to  -f45°  relative  to 
the  talker's  standard  direction  of  facing.  By  sitting* 
on  the  ground  each  listener  had  an  unobstructed  I  SO 
pathway  between  his  ears  and  the  talker’s  mouth.  t  60 
By  sitting  cross-legged  it  was  easy  to  maintain  a§ 
rigid  and  accurate  listening  angle.  1 

In  both  experiments  listening  angle  was  varied  I  jo 
such  that  listening  angles  of  either  0°;  or  0°,  15°,f 
3G°,  45°,  60°,  75°,  and  90°  resulted  for  the  seven  I  ^ 
radials  from  left  tc  right,  respectively. 

The  talker’s  angle,  the  angle  the  talker  faced  with 
respect  to  the  listener  position,  was  varied  in  ex¬ 
periment  I  but  was  constant  in  experiment  II. 


the  talker.  The  fact  that  better  scores  were  obtained 
on  experiment  II  than  on  I  could  be  due  to  sampling 
error  but  more  probably  the  different  scores  are 
due  to  differences  in  wind  velocity  or  experimental 


design. 


/  /  w  a  "js 
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2J.  Experiment  1 

In  experiment  1,  forty-two  listeners  were  used. 
Each  listener  remained  in  the  same  location 
throughout  all  tests,  although.  as  illustrated 


m 


50  «  20 

Senna  &  as  uvjr- 


Fig.  1.  each  rotated  systematically  between  two  dif¬ 
ferent  facings. 

The  talker  angle  was  varied  as  detailed  in  Fig.  1. 

Experiment  I  was  conducted  in  the  afternoon  and 
the  prevailing  Westerly  wind  interfered  sabstzntia-- 
iy  with  the  intelligibility  of  the  speesh. 


Fig.  2.  On  ihe  left,  per  cent  of  Rhyme  words  correct 
w  distance  from  talker.  Each  of  the  experimen¬ 
tal  points  (solid  circles  and  squares)  represents 
an  average  score  for  7  listeners  and  14  tests  of 
SO  words  each.  All  talker-angle  and  listener 
facing-angle  data  are  combined.  Also  shows 
are  intelligibility  data  vs  distance  from  three 
other  studies.  The  open  symbols  and  defied 
lines  represent  intelligibility  scans  predicated 
ccj  a  3  cB  deep  is  kid  tor  each  distance 
doarded.  The  triangles  seacesea  a  6dB  per- 
dhtaacerioardec  drop  is  isteiligridHiy. 
fta  the  right  is  per  cent  words  coated  vs  speecv 
to-aosse  diSereatial  is  dB  £w  Rhysae  and  PB 
words.  This  is  rise  data  used  far  rise  3  and  6dB 
per-dhtance-doehltd  data  pkfid  «a  the  lest. 


2-3  Experiment  II 

In  experiment  II,  a  second  group  o!  49  recruits 
was  seated  in  7  arcs  and  7  radials.  These  49  listen¬ 
ers  were  selected  from  the  60-odd  regular  members 
of  the  available  recruit  company  cm  the  basis  of  rise 
practice  test  scores.  In  order  to  reduce  the  effect  of 
listeners  on  location,  each  listener  was  shifted  dia¬ 
gonally  back  one  arc  and  over  one  radial  after  each 
pair  of  tests.  In  this  manner  each  listener  was  in 
each  arc  and  in  each  radial  for  one  pair  of  tests. 
The  alternation  its  the  listening  angles  between  the 
odd  and  era.  tests  in  each  pair  of  tests  was  executed 
Inst  as  in  the  first  experiment,  and  the  other  testing 
details  were  the  same. 

The  experiment  II  session  was  held  in  the  mint¬ 
ing  before  the  prevailing  winds  could  interfere  and 
the  wind  velocity  was  only  0  to  9  km/h. 

In  experiment  II  the  talker  always  faced  the  cen¬ 
ter  of  the  group,  the  0°  radial. 

3.  Results 

3J  / n’eliip bility  ex  distance 

In  Fig.  2  arc  plotted  the  results  of  experiments  I 
and  II  in  terms  of  words  correct  vs  distance  from 


Concerning  rise  wind:  In  experimental  I  the  easts 
of  wind  averaged  27kmjh  and  peaked  at  33 xm/h. 
b  experiment  II  the  wind  velocity  varied  from  O 
to  9  km/h.  Hates  and  Omwosts  [51  found  that 
when  a  listener  faced  a  33  km/h  wind,  his  thresholds 
for  tones  in  the  frequency  range  of  speech  were 
shifted  by  40  to  45  dB.  By  turning  away  45"'  and 
90°  this  shift  was  reduced  to  30  and  15  dB,  re¬ 
spectively-  For  a  wind  of  183  km/h  they  iccsd  that 
the  threshold  shift  was  only  about  half  as  great. 
Since  the  winds  came  in  gusts,  the  results  of  experi¬ 
ment  I  are  not  as  different  from  II  as  the  Hates 
and  Ccbwckth  data  [3]  might  suggest,  hut  for 
oserveia  on  the  spot  at  the  time  there  was  no  ques 
tion  hut  that  the  excessive  winds  during  experi¬ 
ment  I  reduced  word  intelligibility  scores. 

In  the  data  reported  by  Xxctssrv  [2].  a  wind  of 
33  to  41  km/h  reduced  the  distance  for  lo%  intel¬ 
ligibility  from  43  m  to  26  tn.  In  the  present  expert 
meals  the  15%,  coverage  was  reduced  from  25  m 
to  about  15  si,  a  redaction  of  the  same  proportion. 
This  would  suggest  that  wind  interference  was  the 
near-exclusive  cause  of  the  general  intelligibility  dif¬ 
ference  between  experiments  I  and  II. 

The  effect  that  the  experimental  design  might 
have  had  cm  depressing  the  scores  in  experiment  I 
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versus  il  will  be  discussed  in  the  section  of  effects 
of  talker  angle. 

Plotted  as  solid  lines  for  extrapolated  dashed 
lines)  in  Pig.  2  are  the  results  of  other  speech  intel¬ 
ligibility  tests  in  open  spaces  reported  by  Challtova 
and  Slavik  [1].  Kxcdsen  [2],  and  Lifshitz  [3].  Ex¬ 
cept  for  experiment  I,  the  windy  one,  the  rate  at 
whidt  intelligibility  fails  off  with  distance  is  fairly 
consistent.  By  utilizing  the  information  plotted  on 
the  right  of  Fig.  2.  it  is  possible  to  check  whether 
the  rate  of  decrease  in  intelligibility  with  distance 
follows  a  decrease  in  sound  pressure  level  of  6dB 
or  3dB  per  distance  doubled.  The  Montague  {6] 
data  concerns  the  relation  between  word  intelligibi¬ 
lity  and  speech-to-noise  ratio  for  Rhyme  words 
fused  in  experiments  I  and  Ii)  and  for  PB  words 
(extensively  used  in  other  tests  of  this  type).  The 
open  symbols  on  the  left  of  Fig.  2  were  derived  from 
the  data  on  the  right  of  Fig.  2  to  show  that  decrease 
in  word  score  per  distance  doubled  is  very  exactly 
predicted  by  successive  decreases  in  speech  level  of 
3dB.  The  triangles  on  the  left  show  that  the  word 
scores  with  distance  doubled  do  not  fall  oS  as  fast 
ats  would  be  predicted  by  successive  drerews  in 
speech  level  of  6dB- 

Has  implies  that  the  physical  Inrs  of  cylindrical 
waves  in  ©pea  space  zre  bras  followed  and  that  the 
physical  laws  of  spherical  waves  are  nol. 

These  results  bear  on  one  of  the  assssp&eas  in¬ 
herent  in  feuesx's  icxmalsthai  ci  Speed:  Inter¬ 
ference  Ixwfe  171.  [8].  The  assumption  is  feat  to 
regain  equal  speech  iatUmbSir  the  acceptable 
noise  kid  must  decrease  6d3  per  distance  dodhied 
for  distances  up  to  4m.  The  results  of  fee  present 
experiments  imply  feat  Ley  end  4n  n.  even  is  open 
air.  fee  acceptable  case  Sr«3  seed  decrease  oely 
3aB-  not  6dB.  per  cjsasace  doubled.  This  statement 
assumes  of  crane  feat  speech  level  is  always  really 
a  speech-to-noise  differential  and  feat  a  decreasing 
speech  kid  is  exactly  equivalent  in  dB  to  an  in- 
arasngnaisrieid. 

3J3  IntrSipbiUly  fs  IcUxr  cagfc 

In  experiment  1  the  data  oa  angle  of  listener 
facing  are  subject  to  large  interactions  involving 
wind  direction-  Those  listeners  who  faced  most 
nearly  into  fee  wind  when  they  faced  fee  talker 
showed  larger  effects  dae  to  differences  in  listener 
fating  than  those  listeners  who  were  not  facing  into 
the  wind  when  they  faced  the  talker.  Experiment  I 
was  not  designed  to  take  account  of  the  wind  and 
fee  listener  angle  data  have  a  wind  bias  feat  cannot 
he  isolated. 

Information  on  talker  angle,  however,  is  not  so 
defers  cert  on  fee  wind  because  fee  talker  rotated 


through  many  angles  and.  as  he  did.  the  relative 
position  between  him  and  the  listeners  was  some¬ 
what  balanced  with  respect  to  wind  direction.  That 
is.  as  the  talker  faced  the  center  of  the  group,  those 
listeners  slightly  to  his  right  faced  the  wind,  but  3s 
the  talker  rotated  to  90  those  slightly  to  his  right 
were  at  right  angles  to  the  wind.  Therefore  the  re¬ 
maining  discussion  of  experiment  I  will  be  limited 
to  the  results  on  talker  angle. 

Fig.  3  shows  a  polar  plot  of  word  intelligibility  as 
a  function  of  talker  angle;  that  is,  the  word  intelli¬ 
gibility  as  a  function  of  where  the  listener  set  with 
respect  to  the  direction  the  talker  faced.  The  direc¬ 
tion  the  listener  faced  with  respect  to  the  talker  is 
not  considered. 


F%-3-  Ecsal  ifiSeSHrSalay  oescs  vs  K|le  foea  as¬ 
ks  fer  Sosspcd  Essratx-farisg  auxirs.  CaSterns 
are  Amea  <rra  a  of  fee  talker 

xsgSe  iuxariss  from  fee  operaacai  I  data 
ia  F%.2-  The  as ashes*  es  fee  periphery  are 
mean  vexes  canect  for  eao  talker  as^Je. 

The  numbers.  ia  parentheses  ca  fee  plot  sfcow  fee 
average  pest  entire  of  words  correctly  beard  by  ail 
fisSexrs  who  sit  or  radix's  feat  were  at  oec  feme  or 
another  from  —  -15°  to  a-225*'  wife  rtsjecJ  to  fee 
talker's  mouth.  These  data  show  feat  listeners  heard 
distinctly  better  vrta  is  front  as  compared  to  be¬ 
hind.  The  hr  value  of  77  a?  — 53“  is  based  on 
only  ore- third  fee  number  of  listeners  as  fee  value 
at  -r-55'';  similarly.  fee  data  a!  —30s1  represents 
©oSy  three- £1  fes  fee  raster  of  cases  as  at  -f30'. 
Tterdojc  a  truer  picture  of  fee  —45°  to  0s  results 
would  probably  be  fee  mirror  image  cf  fee  0°  to 
-r45®  results,  since  fee  positive  rak  results  are 
based  on  fee  results  of  many  more  fcfcaos. 

The  data  are  presented  as  contours  of  equal  is- 
tdSgibiBty  so  feat  fee  results  can  be  more  easily 
compared  So  fee  Chaictova  arid  Sunt  {1}  remits. 
To  draw  equal  intelligibility  contours,  all  fee  data 
at  cafe  talker  angle  were  first  plotted  as  a  function 
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of  distance  (arc).  Curves  were  then  visually  fitted 
to  these  data  points.  The  intersections  of  these  cur¬ 
ves  with  discrete  intelligibility  levels  of.  say.  95,  90. 
85.  80%.  were  then  read  off  and  plotted  in  Fig.  3. 

The  equal  intelligibility  data,  just  as  the  average 
of  all  the  data,  show  that  listeners  in  front  of  the 
talker  got  decidedly  higher  scores  than  listeners  be¬ 
hind  the  talker.  Scores  from  —45°  to  -f  45°  are 
roughly  equal  and  only  slightly  greater  than  those 
out  to  angles  of  75°. 

3  3  Intelligibility  vs  listener  facing  angle 

Experiment  11  was  designed  to  get  reliable  in¬ 
formation  on  angle  of  listener  facing.  A  time  was 
chosen  when  the  wind  velocity  was  low.  the  talker 
always  faced  the  center  of  the  listening  group,  and 
after  each  series  of  two  tests  the  listeners  moved 
diagonally  back  one  arc  and  counterclockwise  one 
radial. 

Equal  intelligibility  contours  from  the  results  of 
experiment  II  are  {dotted  in  Fig.  4.  The  two  lines  at 
each  contour  represent  those  who  always  faced  the 
talker  (the  lower  boundary  of  the  shaded  areal  and 
those  who  faced  away  from  the  talker  by  the  angle 
shown  oh  the  tap  arc-  These  were  the  same  listeners, 
bat  half  die  time  they  faced  the  talker  and  half  the 
tea:  they  faced  normal  to  -he  most  deck  wise  radial. 
When  the  listeners  faced  away  from  the  talker,  their 
scores  consistently  averaged  higher,  or,  as  plotted 
in  Fig.  4.  they  got  the  same  average  score  at  greater 
distances  from  the  after. 

The  mean  intelligibility  scores  for  the  listeners, 
when  their  listening  angles  were  varied,  averaged  87, 
90,-91.  93. 92.  92.  and  91%  for  die  angles  from  0° 
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Zzrixrz  zt  «3=£  cnc&jisSij 

Fie.  4.  Equal  rutrlHgiHlitr  oescas  derived  from  the 
data  ct  qpcihacat  II-  The  talker  always  faced 
the  cesses-  of  the  grasp  so  iafaraaika  co  talker 
is  read  stasaakaBr  amend  the  center 
radial,  frees  O'  threagh  Z  25%  ±3Q=,  and 
±45=.  The  &aatx  iachsg  angle  was  either  0s 
(hnssaea  Ew  of  shaded  area)  or  the  angle 
designated  cn  the  redials  trading  clockwise 
frost  0*  s»  90®. 


to  90°,  respectively,  in  15 1  increments.  This 
amounts  to  an  advantage  for  the  30°.  60',  and  90° 
angles  of  4,  5,  and  -1%.  respectively.  This  compares 
with  an  advantage  of  13.  14,  and  12%.  respectively, 
for  the  same  angles  in  an  investigation  by  Nokolunp 
and  Fhit/ell  [9],  who  worked  in  the  70%  intelligi¬ 
bility  region  with  speech  picked  up  binaurally  via  a 
dummy  head. 

The  results  agree  qualitatively  with  those  of  Ro¬ 
binson  and  Whittle  [10]  in  their  experiments  on 
the  effects  of  angle  of  reception  upon  the  aural  ef¬ 
fectiveness  of  pure  tones.  In  their  binaural  loudness 
tests  they  observed  gains  over  a  similar  angular 
range  a!  2500  and  4000  c/s.  Their  measurements  of 
SPL  at  the  canal  enhance  of  the  ear  also  show  an 
average  gain  at  these  angles,  as  compared  to  the 
case  when  the  source  was  dead  ahead.  Their 
measurements  closely  correspond  to  the  measure¬ 
ments  of  Wieneh  and  Ross  [llj.  whose  mean  45“ 
and  90°  SPL  readings  at  the  entrance  to  the  ear 
canal  averaged  4  or  5dB  better  than  the  0°  SPL 
readings  over  the  frequency  range  from  400  to 
6000c Is. 

4.  Cowjuriwa  to  other  data 

Toe  data  of  experiment  I  and  the  data  on  0“ 
angle  of  listener  facing  of  experiment  II  Have  fcc- 
combined  to  obtain  three  sets  of  cqual-intelligib  .y 
contours,  plotted  in  Fig.  5.  These  contours  represent 
generalized  data  cn  talker  angle.  Also,  on  the  right 
of  Fig.  5  are  the  polar  plot  displays  of  selected  con¬ 
tours  from  Guitpota  and  Suva  [1),  and  Lirhrz 
[3],  and  an  extrapolated  contour  bxsed  on  the  data 
reported  by  Kxusd;  [?].  On  the  left  of  Fig.  5  are 
some  SPL  data  of  Dot  and  Faxsswhjb  !  12]  and 
of  Fuxic.«  (131.  These  data  show  the  directional 
patterns  of  sound  from  a  source  like  a  mouth. 

Concerning  the  intelligibility  contours  shown  on 
the  right,  it  is  evident  that  the  present  data  are  tar 
more  squat  in  shape  than  any  of  the  others.  As  com¬ 
pared  to  Use  other  contour  shapes,  fee  present  data 
are  relatively  fiat  on  tew  (bum  0°  to  -5-45°). 

As  regards  the  depression  directly  in  front  of  the 
talker,  which  was  observed  in  the  Chautota  and 
Sura  data,  this  was  passably  caused  by  inter¬ 
ference  effects  of  the  bodies  of  listeners  between  the 
talker  ami  other  listeners  and  by  the  slight  dis¬ 
advantage  that  exists  for  listeners  who  face  the  tal¬ 
ker  directly  as  compared  to  those  who  face  a!  small 
angles  sway  from  the  talker.  In  the  rectangular  for¬ 
mation  used  in  their  experiment  the  degree  that 
listeners  were  in  the  shadow  of  other  listeners  was 
dependent  both  cn  the  particular  angle  from  the 
talker  and  five  distunre  from  the  talker.  This  would 
cause  angular  anc  distance  effects  on  the  nansmis- 
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sion  of  the  highly  directional  high  frequency  com¬ 
ponents  of  ihe  s-peedt.  This  in  turn  could  have 
systematically  affected  the  reception  of  consonants 
in  suth  a  way  as  to  account  for  a  large  share  of  the 
detail  in  their  equal  intelligibility  contours. 

It  is  quite  evident  that  the  four  studies  do  not 
agree  in  the  magnitude  of  the  percentage  of  cor¬ 
rectly  heard  words  vs  distance.  Since  the  present 
interest  is  in  contour  shape,  this  need  not  be  of  too 
great  concern.  Talker  level,  wind  velocity,  tem¬ 
perature  gradients,  differences  in  vocabulary,  and 
even  differences  in  motivation  could  acount  for  these 
discrepancies. 

On  the  left  it  is  interesting  to  note  that  the  present 
data  agree  very  well  with  the  SPL  of  speech 
measured  in  the  62  to  12,000  c/s  band  by  Dess 
and  Farnsworth  [12].  The  Flanagan  [13]  and  the 
2  to  2.8kc/s  data  of  Dcsx  and  Farnsworth  lie 


about  half-way  between  the  shapes  of  die  present 
data,  and  Lifsihtz,  and  Chalupova  and  Slavic  data 
which  have  steejier  sides  from  about  i  15°  to 
±90°. 

The  Flanagan  curve  in  Fig.  5  is  based  on  his 
SPL  measurements  of  a  2  kc/s  tone  projected 
through  the  mouth  of  a  model  [13],  while  the 
Dunn  and  Farnsworth  curves  [12]  are  based  on 
speech  SPL  measurements  on  a  live  talker.  In  both 
cases  the  measurements  were  made  about  30  cm 
from  the  source.  To  make  up  these  curves  for  Fig.  5, 
relative  SPL  measurements  from  the  plotted  data 
of  the  references  cited  were  converted  to  relative 
distance  on  the  basis  that,  as  distance  is  doubled. 
SPL  is  reduced  by  6  dB,  since  both  studies  showed 
that  at  distances  close  to  the  source  this  is  the  rate 
at  which  sound  pressure  falb  off  with  distance. 
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Fig.  5. 

Equal  intelligibility  contours  of  combined  data 
from  experiments  I  and  II  and  inn  references 
[1],  [2].  and  [3].  Contours  of  equal  soa&d 
press:!  nr  lewd  for  (1)  broad  band  and  narrow 
band  speedi  derived  from  data  in  reference 

[12] ,  and  (2)  tones  from  an  artificial  mouth 
is  a  model  bead  derived  from  data  in  reference 

[13]  are  abo  shewn. 
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